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Abstract 

(N; 

^ , It is shown that the problem of derandomizing Noether's Normahzation Lemma (NNL) 

■ in a strong form for the ring of invariants of the adjoint action of the general linear group 
Q,^ . on a tuple of matrices can be brought down from EXPSPACE (where it was earlier) to 

' P SPACE unconditionally, to PH assuming the Generalized Riemann Hypothesis (GRH), 

, and to P assuming the black-box derandomization hypothesis for symbolic trace identity 

■ testing (STIT) or equivalently symbolic determinant identity testing (SDIT). This deran- 
^SI . domization problem lies at the heart of the wild problem of classifying tuples of matrices to 

■ which the problem of classifying representations of any (finite dimensional) algebra or quiver 
^ , can be reduced. By the result here, black-box derandomization of STIT implies an explicit 

• '~j ■ solution to a coarser version of this classification problem, namely, the problem of explicitly 

r> , parametrizing and decomposing the semi-simple tuples (i.e. tuples with closed orbits), where 

■ the phrase explicit has a precise complexity theoretic meaning. Variants of the result are 
also shown assuming, instead of the black-box derandomization hypothesis, arithmetic lower 
bounds for constant depth circuits, or Boolean lower bounds for constant-depth threshold 
circuits or uniform Boolean conjectures in conjunction with GRH. 

It is also shown that Noether's Normalization Lemma for the ring of invariants for any 
finite dimensional rational representation of the special linear group of fixed dimension can 
be quasi-derandomized in a strong form unconditionally, thereby bringing this problem from 
EXPSPACE (where it was earlier) to quasi-DET C quasi-NC. 

A key ingredient of the proofs is to show that the varieties associated with the invariant 
rings under consideration are explicit, as per the general notion of explicit varieties intro- 
duced in this paper. It is shown that a strengthened form of the black-box derandomization 
hypothesis for polynomial identity testing (PIT) is equivalent to the problem of derandom- 
izing Noether's Normalization Lemma in a strict form for general explicit varieties. It is also 
shown that a strengthened form of the black-box derandomization hypothesis for depth three 
arithmetic circuits is equivalent to the problem of derandomizing Noether's Normalization 
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Lemma in a strict form for the higher order secant variety of the Chow variety. Finding 
exphcit defining equations for this variety is a long-standing problem of invariant theory. 

These and other results in this paper may explain in a unified way why proving lower 
bounds or derandomization results for arithmetic circuits of even depth three or constant- 
depth Boolean threshold circuits, or proving uniform Boolean conjectures without relativiz- 
able proofs has turned out to be so hard, and also why the classification problems of invariant 
theory and algebraic geometry have turned out to be so hard from the complexity-theoretic 
perspective. Thus this investigation reveals that the fundamental problems of Geometry 
(classification) and Complexity Theory (lower bounds and derandomization) share a com- 
mon root difficulty that lies at the junction of these two fields. We call it the GCT chasm. 

1 Introduction 

Noether's Normalization Lemma (NNL), proved by Hilbert |H12j . is the basis of a large number 
of foundational results in algebraic geometry such as Hilbert's Nullstellansatz. It also lies at the 
heart of the foundational classification problem of algebraic geometry. For any projective variety 
X C P[K^) of dimension n, where K is an algebraically closed field of characteristic zero and 
P{K^) is the projective space associated with K^^ the lemma says that any random (generic) 
linear map 'ijj-.K^-f K"", for any m > n + 1, is regular (well defined) on X. Furthermore, for 
any such ^|), ip{X) C P[K"^), the image of X, is closed in P{K"^), and (2) the fibre 'ip~^{p), for 
any point p € ip{X), is a finite set. In the context of the main results of this paper, k will be 
exponential in n and m will be polynomial in n. In this case Noether's Normalization Lemma 
expresses the variety X, embedded in the ambient space P{K^) of exponential dimension, as a 
finite cover of the variety tp{X), embedded in the ambient space P{K'^) of polynomial dimension. 
This is its main significance from the complexity-theoretic perspective. By derandomization of 
Nother's Normalization Lemma we mean deterministic construction of ip. We also refer to this 
problem as NNL in short. It turns out to be very difficult in general. First, the number of 
random bits used by the existing algorithms for general X is polynomial in k, the dimension of 
the ambient space containing X. Since k is exponential in n, the number of random bits used 
is thus exponential in n. Second, for general X as above, the current best algorithms for even 
deterministic verification of V') let alone construction, based on a recent fundamental advance 
|MR2] in Grobner basis theory, take in the worst case space that is polynomial in k and time 
that is exponential in k. The space bound for deterministic verification as well as construction 
is thus exponential in n and the time bound is double exponential in n. (Before |MR2j the time 
bound was double exponential in /c, and hence, triple exponential in n.) Nothing better can be 
expected for general varieties because |MMt IMRlj also prove a matching lower bound for the 
computation of Grobner basis in this general setting. 

We show that a stronger form of NNL that arises in the context of the problem of classify- 
ing tuples of matrices under the adjoint (simultaneous conjugate) action of the general linear 
group can be brought down from EXPSPACE, where it was earlier for the reasons above, 
to P SPACE unconditionally, to PH assuming the Generalized Riemann Hypothesis (GRH), 
and even further to P assuming the black-box derandomization hypothesis for symbolic trace 
identity testing (STIT) or equivalently symbolic determinant identity testing (SDIT). We spec- 
ify the underlying variety using a succinct specification of bit-length linear in the dimension of 
the variety. The space and time bounds are in terms of the bit-length of the succinct specifi- 
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cation. The classification problem mentioned above is the well-known wild |Dzl IDW] problem 
of invariant theory, also called an "impossible" or "hopeless" problem, to which the problem of 
classifying representations of any (finitely dimensional) algebra or quiver can be reduced. By the 
result here, black-box derandomization of STIT implies an explicit solution to a coarser version 
of this classification problem, namely, the problem of explicitly parametrizing and decomposing 
the semi-simple tuples (i.e. tuples with closed orbits), where the phrase explicit has a precise 
complexity theoretic meaning. Variants of this result are also shown assuming, instead of the 
black-box derandomization hypothesis, arithmetic lower bounds for constant depth circuits, or 
Boolean lower bounds for constant-depth threshold circuits or uniform Boolean conjectures in 
conjunction with GRH. 

We also show that Noether's Normalization Lemma for the ring of invariants for any fi- 
nite dimensional rational representation of the special linear group of fixed dimension can be 
quasi-derandomized in a strong form (unconditionally) , thereby bringing this problem from EX- 
PSPACE (where it was earlier for the same reasons) to quasi-DET C quasi-NC. This ring was 
the focus of Hubert's paper |H12| mentioned above. Noether's Normalization Lemma was, in 
fact, proved therein to show that this ring is finitely generated. 

A key ingredient in the proofs is to show that the varieties under consideration are explicit, 
as per the general notion of explicit varieties (Definition 110. 2p formulated in this paper. We 
show that a strengthened form of the black-box derandomization hypothesis for polynomial 
identity testing (PIT) is equivalent to the problem of derandomizing Noether's Normalization 
Lemma in a strict form for general explicit varieties. We also show that a strengthened form 
of the black-box derandomization hypothesis for depth three arithmetic circuits is equivalent to 
the problem of derandomizing Noether's Normalization Lemma in a strict form for the higher 
order secant variety of the Chow variety. Finding explicit defining equations for this variety is 
a long-standing problem of invariant theory. 

These and related results may explain in a unified way why proving lower bounds or deran- 
domization results for arithmetic circuits in characteristic zero of even depth three or constant- 
depth Boolean threshold circuits, or proving uniform Boolean conjectures such as EXP^^ ^ 

^ ^2 turned out to be so hard, and also why the classification problems of invariant 
theory and algebraic geometry have turned out to be so hard from the complexity-theoretic 
perspective. Thus this investigation reveals that the fundamental problems of Geometry (clas- 
sification) and Complexity Theory (lower bounds and derandomization) share a common root 
difficulty that lies at the junction of these two fields. We call it the GCT chasm. 

On the negative side, the equivalence in this article says that black-box derandomization 
of PIT in characteristic zero would necessarily require proving, either directly or by implica- 
tion, results in algebraic geometry that seem impossible to prove on the basis of the current 
knowledge. On the positive side, it says that if the fundamental derandomization and lower 
bound conjectures of complexity theory are true, then a large class of fundamental problems in 
algebraic geometry which appear intractable are actually tractable in the complexity-theoretic 
sense. 

This article belongs to a series of articles on the GCT approach to the fundamental problems 
of complexity theory. See |Mu2j for an informal overview of the earlier articles in this series, and 
|Mu3j for a formal overview. Preliminary versions of the results in this article were announced 
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in |Mu4] . We now state the main results in this article in more detail. 



1.1 The ring of matrix invariants 

Let K be an algebraically closed field of characteristic zero. By black-box derandomization of 
PIT over we mean the problem of constructing an explicit hitting set |IWt IHSt IKI^ Ag 



IRShl ISVl IDSYj against all circuits over K and on r variables with size < s. By the size of a 
circuit, we mean the total number of edges in it. There is no restriction on the bit-lengths of 
the constants in the circuit. By an explicit hitting set, we mean a poly(s)-time-constructible set 
Sr,s ^ of test inputs such that, for every circuit C on K and r- variables x = (xi, . . . , x^) with 
size < s, and C{x) not identically zero, Sr^s contains a test input h such that C(6) 7^ 0. Here 
C{x) denotes the polynomial computed by C. The fundamental black-box derandomization 
hypothesis in complexity theory |HS1 IIWl IKH |Ag IRShl ISVl IDSYj is that such explicit hitting 
sets exist. Black-box derandomization of PIT is important, because by the fundamental hardness 
vs. randomness principle |IWt IHSl IKH "Kg it is essentially equivalent to proving circuit lower 



bounds for EXP, which are much easier variants of the nonuniform P vs. NP problem. 

In our first result, we consider a restricted form of PIT called symbolic trace identity testing 
(STIT). It is equivalent [MP[ Sp , up to polynomial factors, to the PIT for arithmetic branching 



programs or weakly skew straight-line programs, and also to the symbolic determinant identity 
testing (SDIT); cf. Section EH 

By a symbolic trace over we mean a polynomial of the form trace(^(x)'), where A{x) 
is an m X m symbolic matrix whose each entry is a linear function over K in the variables 
X = (xi,...,Xr). By black-box derandomization of STIT over we mean the problem of 
constructing an explicit (poly(m, r)-time computable) hitting set Sr,m of test inputs in such 
that, for any symbolic trace polynomial trace(^(x)'), I < m, that is not identically zero, there 
exists a test input b € Sr^m such that trace(^(6)') 7^ 0. 

Let Mm{K) be the space of m x m matrices over and V = MmiKy , the direct sum of r 
copies of Mm{K), with the adjoint (simultaneous conjugate) action of G = SLm{K): 



(^1, . . . , A) ^ {PAiP-\ PArP-'), (!) 

where Ai, . . . , Ar S Mm{K) and P £ SLm{K). Let Ui, . . . ,Ur be variable mxm matrices. 
Then the coordinate ring K[V] of V can be identified with the ring K[Ui, . . . , Ur] generated by 
the variable entries of C/j's. Let i4r[y]'^ C K[V] be the ring of invariants with respect to the 
adjoint action. By an invariant we mean a polynomial f{U\, . . . , Ur) in the variable entries of 
C/j's such that 

/(C/l, . . . , Ur) = f{PUiP-\ PUrP^'), 

for all P G SLm{K). Let n = dim.{V) = rm?. It is known that is finitely generated |H12t 

EllEi]. Hence, it can be thought of as the coordinate ring of the variety V/G = spec(i^[l^] ), 
called the categorical quotient [MFKj . We specify V and G succinctly by the pair {m^r) in 
unary. This pair will be the input in our problem. Thus the bit-length of this succinct input 
specification is 0{n). All the space and time bounds for the algorithms with this input will be 
in terms of n. 
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By Noether's Normalization Lemma (Lemma l2.10p , there exists a set S C i^r[y] of poly(n) 
homogeneous invariants such that is integral over the subring generated by S. (This 

statement of Noether's Normalization Lemma is equivalent to the one given in the beginning of 
the introduction. Here a ring R is said to be integral over its subring T if every r € R satisfies a 
monic polynomial equation of the form + 6;_ir'~^ + . . . + bir + 5o = 0, where each 6j G T.) In 
fact, there even exists such an S of optimal cardinality equal to dim(i^[y]'^) < n. It is known 
that any suitably randomly chosen S of this cardinality has the required property. Such an S 
of optimal cardinality is called an h.s.o.p. (homogeneous system of parameters) of -ftr[y]'^. By 
the problem of derandomizing Noether's Normalization Lemma for in short NNL, we 

mean the problem of constructing an S of poly(n) cardinality such that iir[y]'^ is integral over 
the subring generated by S. 

Following |DKj (cf. Section 2.3.2 therein), call a set S C iiTiy]'^ separating if for any u,v V 
such that w{u) 7^ w{v), for some w G there exists an s G 5 such that s{u) / s{v). 

Noether's Normalization Lemma for certain varieties Xj[V,G], 1 < j < m?, associated with V 
and G implies that a small separating 5 of poly(?i) cardinality exists; cf. Proposition 12.121 By 
the problem of derandomizing Noether's Normalization Lemma for in a strong form we 

mean the problem of constructing a separating S of poly(n) cardinality. If S is separating, then 
/^[y]*^ is automatically integral over the subring generated by S; cf. Theorem 2.3.12 in |DKj . 
Thus the construction of a small separating S" is a stronger form of derandomization of Noether's 
Normalization Lemma for V/G. 

This stronger form of NNL lies at the heart of the problem of classifying r-tuples of m x m 
matrices over K under simultaneous conjugation by SLm{K). When r = 1, this classification is 
provided by the Jordan normal form. For r > 2, this is the so-called wild classification problem 
of invariant theory. The phrase "wild" here (meaning "impossible" or "hopeless") has a precise 
mathematical meaning |Dzl IBSel IDW] analogous to the phrase "NP-complete" in complexity 
theory. There is also a result |BSe| analogous to the NP-completeness result which says that 
a solution to the problem of classifying tuples of matrices implies a solution to the problem of 
classifying representations of any quiver. By |MFKj . the points oiV/G = SY>ec{K\y]^ are in 
one-to-one correspondence with the closed G-orbits in V . The points in V with closed orbits 
are called stable jMFKj or semi-simple. If we restrict ourselves to only closed orbits as in 
[MFK] ■ then a coarser classification problem is to (1) explicitly parametrize the closed orbits, 
i.e., associate with each closed orbit an explicit point in a one-to-one (regular) manner so that the 
set of such explicit points is a closed variety embedded in an ambient space of small (poly(n)) 
dimension, and (2) explicitly decompose any semi-simple tuple into simple tuples. By [Km], 
each point v ^ V can be driven to the unique closed orbit in the G-orbit closure of v by an 
optimal one parameter subgroup. One also wants to associate with v the instability flag of 
this optimal one parameter subgroup explicitly. In the mathematics literature, the phrases 
"explicit" and "impossible" are used informally. We give them formal interpretation from the 
complexity-theoretic perspective as follows. 

By explicit, we mean poly(ri)-time computable. Specifically, by [MFK], invariants distinguish 
closed G-orbits in V . Hence a separating 5 distinguishes closed orbits. A small separating S 
yields a one-to-one and onto regular map from the closed orbits of V to the points of a closed 
variety embedded in an ambient space of poly(n) dimension; cf. the proof of Theorem l3.51 By an 
explicit parametrization of closed orbits we mean parametrization by a small separating 5 that 
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can be computed in poly(n) time. The problem of computing such an expUcit small separating 
S is precisely the stronger form of NNL. 

The existing techniques for Noether normalization can be applied to V/G to construct an 
h.s.o.p. for Since the dimension of the ambient space containing V/G is exponential 

in n, these techniques take space that is exponential in n and time that is double exponential; 
cf. Proposition 13. 1[ The existing techniques for Noether normalization can also be applied to 
the varieties Xj[V,Gys to construct a separating homogeneous S of small degree and poly(n) 
cardinality (and even optimal cardinality) deterministically; cf. Proposition 13. 2[ These tech- 
niques also take space that is exponential in n and time that is double exponential in n since 
the dimension of the ambient space containing X[V^',G'] is also exponential in n. We interpret 
the phrase "impossible" as referring to the seemingly impossible task of bringing this double 
exponential time bound down to polynomial. 

This may well be impossible if we insist on optimality, as done quite often in the mathematics 
literature, and ask for an h.s.o.p. or a separating S of optimal cardinality, since it is not even 
known if such optimal objects with subexponential bit-length of specification exist. Furthermore, 
it seems difficult to improve on the current exponential space and double exponential time bound 
with the existing techniques if we want a separating homogeneous S of small degree and optimal 
cardinality. But if we do not insist on optimality and only require S to be small of poly(n) 
cardinality then the following result (Theorem II. ID says that this double exponential time bound 
can be brought down to polynomial assuming black-box derandomization of STIT. Specifically, 
it says that the problem of constructing such an S can then be brought down from EXPSPACE, 
where it was earlier for the reasons above, to PSPACE unconditionally, to PH assuming GRH, 
and even further to P assuming the black-box derandomization hypothesis for STIT. The space 
and time bounds are in terms of the bit-length n of the succinct specification. 

We need a few definitions for stating this result. Call a set S C an s.s.o.p. (small 

system of parameters) for if (1) S contains poly(n) homogeneous invariants of poly(n) de- 

gree, (2) i<r[y]'^ is integral over the subring generated by S, (3) each invariant s = s{Ui, . . . , Ur) 
in S has a weakly skew straight-line program |MPj over Q and the variable entries of ?7j's of 
poly(n) bit-length. Here (3) is equivalent |MP| to (3)': every s G 5 can be expressed as the 
determinant of a matrix of poly(n) size whose entries are (possibly non-homogeneous) linear 
combinations of the entries of f/j's with rational coefficients of poly(n) bit length. By |Csl IMPj . 
it follows that, given such a weakly-skew straight-line program of an invariant s £ S and any 
rational matrices Ai, . . . , Ar G Mm{Q), the value s{Ai, . . . ,Ar) can be computed in time poly- 
nomial in n and the total bit-length of the specifications of Aj's (and even fast in parallel). 
Thus an s.s.o.p. is an approximation to h.s.o.p. that has a small specification and is easy to 
evaluate. Henceforth, we only require that S be small (of poly(n) cardinality) and do not insist 
on optimality. We say that 5 is a separating s.s.o.p. if (2) is replaced by the stronger (2)': S is 
separating. 

Let us call a set S an e.s.o. p. (explicit system of parameters) for if (1) S is an s.s.o.p. 

for and (2) given m and r, the specification of S, consisting of a weakly skew straight-line 

program as above for each s £ S, can be computed in poly(77,) time. Thus the key difference be- 
tween an s.s.o.p. and an e.s.o. p. is that an e.s.o. p. can also be constructed uniformly. We call S 
a separating e.s.o. p. if (1) is replaced by the stronger (1)': iS* is a separating s.s.o.p. A separating 
e.s.o. p. yields an explicit parametrization of the closed G-orbits in V; cf. Theorem 1 1.1 1 (b) (1) be- 
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low. A (separating) quasi-e.s.o.p. is defined by replacing the poly(n) bounds by 0(2P°^y^'-'S(")) 
bounds. A (separating) subexponential-e.s.o.p. with a small exponent 5 > is defined by 
replacing the poly(n) bounds by 0(2*^^" •*) bounds. Quasi-s.s.o.p. and subexponential-s.s.o.p. 
(separating) are defined similarly. 

By the following result, black-box derandomization of STIT implies existence of a separating 
e.s.o.p. and thus explicit coarse classification (parametrization and decomposition) of semi- 
simple tuples and GRH implies a P//-algorithm for the construction of a separating s.s.o.p. See 
|AB| for the precise definitions of the various complexity classes used in this result. 

Theorem 1.1 Let K he an algebraically closed field of characteristic zero. Let V and G he as 
above. 

(a) A separating s.s.o.p. for K[V]'^ exists and can be constructed in poly(n) work space uncon- 
ditionally. 

(h) (1) A separating e.s.o.p. yields a one-to-one and onto explicit (polynomial time computable) 
regular map from the closed orbits of V to the points of a closed variety embedded in an ambient 
space of poly{n) dimension. (2) Assuming the black-box derandomization hypothesis for STIT 
over K, K\y]^ has a separating e.s.o.p. Analogous result holds, after replacing the poly{n) 
bound everywhere by 0{n'~^^^°^'"'^) bound, if we assume black-box derandomization over K of PIT 
for depth four circuits instead of STIT. 

(c) Suppose STIT over K for mxm matrices andr variables has O (2^^) -time- computable hitting 
set, n = rrn^ , for any small constant e > 0. Then K\y]'-^ has a subexponential separating e.s.o.p. 
for any exponent 6 > 0. 

(d) Suppose GRH holds. Then: (1) The problem of constructing a separating s.s.o.p. for K\yY' 
belongs to S3 C PH . (2) It also belongs to c quasi-PH . This means there is a quasi- 
polynomial time algorithm, with an access to the NP^^ -oracle, for constructing a separating 
quasi-s.s.o.p. for K[V]'^ . (3) It also belongs to i.o.M Aquasi-haif-EXP- This means there is an 
M Aquasi-haif-EXP-o-^gorithm for constructing a separating quasi-half- exponential s.s.o.p. (as 
defined in Section^ that works correctly for infinitely many n. 

(e) Given a rational v £ V, the instability flag IKm!^ (/>(f ) of an optimal one-parameter subgroup 
that drives v to a point in the unique closed orbit that lies in the closure of the G-orbit of v can 
be computed in time that is polynomial in n and the bit-length of the specification of v. 

(f) Given a semi-simple rational v G V, the decomposition ofv into simple tuples can be computed 
in time that is polynomial in n and the bit-length of the specification of v. 

Here, and in what follows, the prefix i.o. denotes the infinitely-often version [Klj of the com- 
plexity class. The complexity class M Aquasi-haif-EXP in (d) denotes the quasi-half-exponential 
|MVW| variant of MA, and we think of MA class of functions rather than decision prob- 
lems. It is interesting to know how far GRH can take us here in comparison to the black-box 

^We realized that the key mgredient in the proof of existence of a (separating) s.s.o.p. as in (a), namely that 
V/G = spec[K\VY') is an explicit variety (Lemma 13. 7p . can be used to construct a (separating) s.s.o.p. in poly- 
nomial space unconditionally after the publication of the preliminary abstract |Mu4] . As such, the unconditional 
known space complexity of the problem of constructing an s.s.o.p. for this ring (as well the coordinate rings of 
general explicit varieties) was stated in that abstract as exponential, as it was then to our knowledge. 
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derandomization hypothesis for STIT, since this may give us some idea of the difficulty of the 
black-box derandomization hypothesis. Thus we may ask if NNL for i(r[y]'^ can be put in P 
or even NP (thought as a class of functions) assuming GRH. That is open. Theorem 11.11 (d) 
(2) and (3), the main technical results in this paper, are attempts to go as far as we can on the 
basis of GRH with this in mind. It may be possible to strengthen the proof of (d) (3) to show 
that NNL for is in MA assuming GRH; cf. the remark after Theorem 17.51 But as far as 

we can see on the basis of the proof of (d) (3), GRH may not suffice to put the problem in NP. 

The variety V/G = spec{K[V]'^) here is a basic prototype of the moduli space of repre- 
sentations of a wild quiver |DWj . Theorems 11.11 (a)-(d) can be generalized to moduli spaces of 
representations of any wild quiver. By this generalization (Theorem 14. ip . black-box derandom- 
ization of STIT over an algebraically closed field K of characteristic zero implies derandomization 
of Noether's Normalization Lemma in a strong form, and thereby, explicit parametrization of 
semi-simple representations for any wild quiver and dimension data. Analogous result holds for 
semi-simple representations of any finite dimensional algebra; cf. Theorem 13.141 

The following result is a variant of Theorem 11.11 (b) and (c) assuming, instead of the black-box 
derandomization hypothesis, arithmetic lower bounds, or Boolean lower bounds in conjunction 
with GRH. 

Let nonuniform NC[d{n), S{n)] denote the class of Boolean functions that can be computed 
by nonuniform Boolean circuits of depth d{n) and size S{n), and TC^[S{n)] the class of Boolean 
functions that can be computed by constant-depth Boolean threshold circuits of size S{n). Let 
nonuniform NC^ denote nonuniform A^C[log* n, poly(n)]. 

Theorem 1.2 Let V,G, and K he as in Theorem \L1[ 

(a) Suppose EXP has a multilinear integral function with a subexponential arithmetic lower 
bound. Then K[V]'^ has a separating quasi-e.s.o.p. 

The arithmetic lower bound assumption here is that there exists an exponential-time-computable 
multilinear polynomial f{xi, . . . ,Xr) with integral coefficients of poly (r) bit-length such that f 
cannot be computed by an arithmetic circuit over K of (1) 0(2^") size and 0{r°') depth, for 
some constant a > Q, or alternatively, (2) 0(2*^/" ) size and constant depth for some constant 
a' > 0. 

(b) Suppose EXP has a multilinear integral function with a polynomial arithmetic lower bound. 
Then K[V]'^ has a separating subexponential- e.s.o.p for any exponent 6 > 0. 

The arithmetic lower bound assumption here is that f as in (a) is not in algebraic-NC"^ ; i.e., 
it cannot be computed by an arithmetic circuit over K o/0(log^ r) depth and 0{r"') size for any 
constant a > 0. 

(c) Suppose EXP has a subexponential Boolean lower hound and that GRH holds. Then K\VY' 
has a separating quasi-e.s.o.p. 

The Boolean lower hound assumption here is that EXP % i.o. nonuniform- N C [n'' ,2^^], for 
some constant e > 0, or alternatively, that EXP ^ z.o.TC'^[2"'/" ], for some constant a' > 0. 
The prefix i.o. can he removed from the assumptions and added to the conclusions. 

(d) Suppose EXP has a polynomial Boolean lower hound and that GRH holds. Then K\y]'~^ has 
a separating subexponential- e.s.o.p. for any exponent 5 > 0. 
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The Boolean lower bound assumption here is that (1) EXP ^ i.o. nonuniform-NC'^ , or 
alternatively, that (2) EXP g io.TCO[n<^(v^^°s")] . The prefix i.o. can he removed from the 
assumptions and added to the conclusions. 

(e) Suppose EXP ^ MA and that GRH holds. Then there is a subexponential time algorithm 
for constructing a separating subexponential- s.s.o. p. for K[V]'^ , for any exponent 6 > 0, that is 
correct for infinitely many n. 

(f) The results in (a)-(e) also hold, replacing EXP by EXP^^ in the assumptions and giving 
the algorithm for constructing S an access to the N P-oracle. The uniform lower hound assump- 
tion in (e) is replaced in this case by the uniform assumption that EXP^^ ^ Eg" H , or 
alternatively, that NEXP ^ MA. 

If we fix a conjecturally hard function in EXP, such as the permanent, then the proof of 
Theorem 11.21 (a), in conjunction with the arithmetic NW-generator |NW1 [Kl] . yields a concrete 
conjecturahy correct separating quasi-e.s.o.p., and thereby, a conjecturally correct quasi-explicit 
parametrization of semi-simple tuples of matrices, and more generally, semi-simple representa- 
tions of any finite dimensional algebra; cf. Theorem 15.31 

1.2 The general ring of invariants 

Next we turn to the general setting when V is any rational representation of G = SLm{K) of 
dimension n. 

Since G is reductive [Fuj , V can be decomposed as a direct sum of irreducible representations 



Here A : Ai > . . . > A;, / < m, is a partition, i.e., a sequence of non-negative integers, and 
Vx{G) is the irreducible representation of G (Weyl module |Fuj ) labelled by A. We specify V 
and G succinctly by giving n and m (in unary) and the multiplicities m(A)'s (in unary) for all 
A's that occur with nonzero multiplicity in this decomposition. The degree d of y is defined 
to be max{|A| = X^Aj}, where A ranges over such partitions. Fix the standard monomial 
basis [DRSl ILRj for each V\{G) and thus a standard monomial basis for V . Let vi,. . . ,Vn be 
the coordinates of V for this basis. This fixes the action of G on V . Let K\y] = K[vi, . . . , f„] 
denote the coordinate ring of V . Let be its subring of G-invariants. We call a polynomial 

f{v) G K\y\ a G-invariant if f{a^^v) = f{v) for all a £ G. By NNL in this context, we 
mean the problem of deterministically constructing an S of poly(n) cardinality such that 
is integral over the subring generated by S. The bit-length of the succinct specification of 
V/G = spec{K[V]^), consisting of n and m in unary and m(A)'s in unary, is 0{n + m). The 
existing techniques for Noether normalization applied to V/G take space that is exponential 
in the bit-length 0(n + m) of the succinct specification and time that is double exponential 
even if m is constant; cf. Section [2.41 and the remarks after Theorem 11.41 The following result 
(Theorem II. 3p says that this double exponential time bound can be brought down to quasi- 
polynomial unconditionally for constant m. 

We say that S Q K[V]'~^ is an s.s.o. p. (small system of parameters) for if (1) 

is integral over the subring generated by S, (2) the cardinality of S is poly(n,m), (3) every 



of G: 




(2) 



A 
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invariant in S is homogeneous of poly(n, m) degree, and (4) every s £ S has a small specification 
in the form of a weakly skew straight-line program [MPj of poly(n, m) bit-length over Q and the 
coordinates vi, . . . ,Vn oi V . Here (4) is equivalent |MPj to (4)': every s G 5 can be expressed 
as the determinant of a matrix of poly(n, m) size whose entries are (possibly non-homogeneous) 
linear combinations of fi, . . . , f„ with rational coefficients of poly(n, m) bit length. A separating 
s.s.o.p. is defined by replacing (1) with the stronger (1)': S is separating |DKj . We define a 
(separating) s.s.o.p. in a relaxed sense by dropping the weakly skew requirement in (4) and the 
degree requirement in (3). 

We call a subset S C /^[l/]'-' an e.s.o.p. (explicit system of parameters) for if (1) S 

is an s.s.o.p. for A'[y]'^, and (2) the specification of S, consisting of a weakly skew straight- 
line program as above for each s G S, can be computed in poly(n, m) time, given n, m, and 
the nonzero multiplicities m(A)'s of V\{Gys as in eq.([2]). A separating e.s.o.p. is defined by 
replacing (1) with the stronger: (1) 5 is a separating s.s.o.p. for A (separating) e.s.o.p. 

in a relaxed sense is defined by requiring to be a (separating) s.s.o.p. in a relaxed sense only. 
A (separating) quasi-e.s.o.p. (in a relaxed sense) is defined similarly. 

If G has constant dimension, then Noether's Normalization Lemma for can be quasi- 

derandomized in a strong form unconditionally, and parametrization of closed orbits can also be 
done explicitly: 

Theorem 1.3 Suppose K is an algebraically closed field of characteristic zero. Let V as in ^ 
be a rational representation of G = SLm{K) of dimension n. Suppose m is constant, or more 
generally, 0{polylog{n)) (as is the case if m = 0((\fd); cf. Lemma \8.11l\) . Then K\V\'~' has a 
separating quasi-e.s.o.p. Furthermore, such a quasi-e.s.o.p. can be constructed by a uniform 
circuit of quasi-polynomial size with oracle access to DET . 

Here DET denotes the determinant function. 

For general m we have the following result which generalizes Theorem 11.11 (a)-(d) (1) to 
any finite dimensional rational representation V oi G assuming the black-box derandomization 
hypothesis for general PIT (instead of STIT) and an additional hypothesis of about explicitness. 

Theorem 1.4 Suppose K is an algebraically closed field of characteristic zero. Let V as in 
^ be a rational representation of G = SLm{K) of dimension n. Suppose the variety V/G = 
spec{K\y]'~^) is explicit in a relaxed sense (cf. Definition \8.9\) . Then: 

(a) A separating s.s.o.p. for K[V]'^ exists in a relaxed sense and can be constructed in poly{n) 
work-space. 

(b) Suppose the black-box derandomization hypothesis for PIT over K holds. Then K[V]'^ has 
a separating e.s.o.p. in a relaxed sense. 

(c) Analogue of Theorem \l.l\ (c) holds in this context. 

(d) Analogue of Theorem \l.l\ (d) (1) also holds. 

This result also holds for any finite dimensional representation of a classical simple algebraic 
group in characteristic zero; cf. Theorem 19.11 
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The general ring K[V]^ in Theorem O is a basic prototype of the ring that arises in the 
context of the fundamental classification (moduli) problem [MFKj of algebraic geometry. Its 
finite generation is the celebrated result of Hilbert |H12j . Before this result it was not even 
known if a finite S, let alone a small S, such that is integral over the subring generated 

by S", exists. Hilbert's first proof of finite generation was nonconstructive. This was severely 
criticized by Gordan as "theology and not mathematics". Noether's Normalization Lemma as 
well as the Nullstellansatz were proved by Hilbert in the course of his second constructive proof 
of this result in response to this criticism. It is fair to say that this constructive proof and 
its various mathematical ingredients such as the Normalization Lemma and the Nullstellansatz 
changed the course of algebraic geometry, representation theory, and the theory of computation 
in the twentieth century. But Hilbert could only show that his algorithm for constructing finitely 
many generators for worked in finite time. He could not prove any explicit upper bound 

on its running time. Such a bound was proved in [P] a century later, and improved significantly 
in [D]. This improved analysis, in conjunction with a recent fundamental advance [MR2] in 
Grobner basis theory, yields an EXPSPACE-algorithm to compute a small S such that i^Tfy]'^ is 
integral over the subring generated by S; cf. Proposition 18.21 This algorithm needs exponential 
space and double exponential time even when ra is constant, because the dimension of the 
ambient space containing V/G is exponential in n even when m is constant; cf. Section [2.41 and 
the remark after Proposition 18. 2[ 

Theorem 11.31 shows that this double exponential time bound can be brought down to quasi- 
polynomial, thereby bringing NNL (even in the strong form) from EXPSPACE to quasi-P (in 
fact, quasi- -DET) unconditionally if m is constant or 0(polylog(n)). Classical invariant theory 
mainly focused on the case when m is constant. For example, Hilbert's paper |H12j mainly 
focused on the case when m = 4. The problem of constructing a finite set of generators for 
was not known to be decidable before this paper even in this case. Thus Theorem 11.31 
does put NNL in quasi-DET unconditionally in the case of that Hilbert's paper focused on. 

Theorem 11.41 says that NNL (in a strong form) can be brought down from EXPSPACE^ 
where it is currently, to P for any m assuming black-box derandomization of PIT and also 
assuming that the variety V/G = spec(iC[y]^) is explicit in a relaxed sense (Conjecture I8.10p 
as per the general definition of an explicit variety (Definition 110. 2p introduced in this article. 

1.3 Equivalence 

A large class of algebraic varieties that are studied in algebraic geometry are explicit. For 
arbitrary explicit varieties we have the following result. 

Let the base field K be an algebraically closed field of characteristic zero. 

Theorem 1.5 For any explicit variety specified succinctly ( Definition 1 1 0. 2\) . the problem of de- 
randomizing Noether's Normalization Lemma over K is (1) in PSPACE unconditionally, (2) in 
S3 C PH assuming GRH, and (3) in P assuming a strengthened form of black-box derandom- 
ization of PIT (defined in Section \10.1\) . 

The following result says that this strengthened form of black-box derandomization of PIT 
is, in fact, equivalent to derandomization of Noether's Normalization Lemma in a strict form 
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for a specific explicit variety as follows. 
Theorem 1.6 (Equivalence) 

(1) A strengthened form (defined in Section \10.1\) of black-box derandomization of symbolic de- 
terminant or trace identity testing (SDIT or STIT) is equivalent to derandomization of Noether's 
Normalization Lemma (in a certain strict form) for the explicit algebraic variety A[det,m] asso- 
ciated with the determinant in JMSl^ in the context of the permanent vs. determinant problem. 

(2) A strengthened form of black-box derandomization of general PIT over K is equivalent to 
derandomization of Noether's Normalization Lemma (in a strict form) for the explicit algebraic 
variety similarly associated with the complexity class P in \MS1^ in the context of the algebraic 
P vs. NP problem. 

(3) A strengthened form of black-box derandomization of PIT for depth three circuits over K 
is equivalent to derandomization of Noether's Normalization Lemma (in a strict form) for the 
(polynomially high order) secant variety of the Chow variety. 

See Section [10.41 for the full statements of these results. The explicit varieties occurring in 
this result are harder then the categorical quotient V/G in Theorem 11.11 in the following sense. 

First, the variety V/G is normal and has rational singularities |Btj (even in the general 
setting of Theorem II. 4p . whereas A[det,?TT.] is not even normal |Kuj . 

Second, explicit defining equations are known for V/G in Theorem 11.11 They are given 
by the second fundamental theorem (SFT) for matrix invariants [Prl \Rz\ IFoj . (For this rea- 
son, the problem of constructing an e.s.o.p. for in Theorem II T I without requiring the 
separation property is expected to be easier.) Such defining (or close □ to defining) equations 
play a crucial role in the known cases of derandomization of Noether's Normalization Lemma 
that lead to black-box derandomization of nontrivial subclasses of PIT. For example, the ex- 
plicit determinantal equations |L1] of the secant variety of the Veronese variety are crucial 
for derandomization of Noether's Normalization Lemma for this variety and black-box (quasi) 
derandomization of diagonal depth three circuits. They are implicitly used in all existing tech- 
niques for black-box derandomization of diagonal depth three circuits, such as |SVj and |ASSj . 
The explicit determinantal equations of the Grassmanian, G/P, and Schubert varieties are cru- 
cial for derandomization of Noether's Normalization Lemma for these varieties and black-box 
derandomization of the Grassmanian SDIT; cf. Section 111.21 They are implicitly used in all 
existing techniques for black-box derandomization of the Grassmanian SDIT, such as the one in 
Section dLU here or [FS] . 

But for the varieties in Theorem 11.61 finding explicit defining equations is a huge challenge. 
It is intimately related to the century-old plethysm and Kronecker problems of invariant theory; 
cf . |MS2[ ILlj . Finding explicit defining equations is also extremely hard for the explicit varieties 
that arise in the construction of a separating e.s.o.p. in Theorem 11.11 (b) assuming black-box 
derandomization; cf. Example (1) in Section [10.2.11 For V/G in Theorem 11.41 the problem of 
finding explicit defining equations, also called the problem of proving the second fundamental 
theorem (SFT) in general, is considered "completely unsolvable" (cf. page 238 in |P Vj ) . Thus 

^Strictly speaking, we do not need the equations to be defining in the context of NNL. We only need that they 
cut out a variety of polynomial dimension containing the variety under consideration. 
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a common feature of all wild problems considered in this paper is that the problem of finding 
explicit defining (or close to defining) equations for the underlying varieties turns out to be 
extremely hard. This seems to be the common cause of their wildness. 

I. 4 The GCT chasm 

By Theorem 11.51 NNL for explicit varieties is in PH assuming GRH. Thus currently the main 
obstacle to putting NNL for general explicit varieties in PH unconditionally is GRH. Removing 
this obstacle already seems to be a huge challenge. As already mentioned after Theorem 11.11 
GRH may suffice to put NNL in MA but possibly not NP. Bringing NNL from PSPACE all 
the way to P, where the black-box derandomization hypothesis takes us (cf. Theorems 11.11 11.51 
and II. 6p , may be much harder and well beyond the reach of the existing techniques even for 
depth three circuits. 

Theorems 11.11 11.21 and 11.61 may thus explain why proving black-box derandomization results 
for PIT for even depth three circuits, proving super polynomial constant-depth-arithmetic-circuit 
or Boolean-TC''-circuit lower bounds for even EXP^^ , or proving even very conservative uni- 
form Boolean conjectures such as EXP^^ ^ S^' ^ ^2 turned out to be so hard. In con- 
trast, there are already known derandomization results for several versions of restricted PIT's 
and restricted classes of depth-three circuits in characteristic zero (cf. the survey |SY| and 
the references therein), a quadratic lower bound for depth three circuits in characteristic zero 
|SWj . a quadratic determinantal lower bound for the permanent |MRj . superpolynomial AC^ 
lower bounds for parity and majority (cf. |BSij and the references therein), a superpolynomial 
ACC lower bound for EXP^^ [Wij . and uniform hierarchy theorems such as EXP^P % A^, 
along with the known barriers [BGSl IRRl lAWj to the proof techniques of some of these lower 
bounds. Since non-black-box derandomization of PIT is closely related to circuit lower bounds 
for NEXP jKIj . Theorem 11.21 (f) may explain why even partial non-black-box derandomization 
of PIT has turned out to be so hard. Theorems II. H [T~4l 11.51 and 11.61 may explain the difficulty 
of the classification problem in algebraic geometry from the complexity-theoretic perspective. 

The results in this paper thus reveal a chasm, in the terminology of |AVj . at depth three 
arithmetic circuits, at TC^ Boolean circuits, and at uniform conjectures such as EXP^^ ^ 
n H^. This chasm also extends to geometry (classification). As per Theorems II. H II. 2^ 

II. 41 and 11.61 the root cause of this chasm common to geometry and complexity theory lies at 
the junction of these two fields, namely, the problem of derandomizing Noether's Normalization 
Lemma. We refer to the difficulty of derandomizing Noether's Normalization Lemma, or formally 
the existing PSPACE (or rather | EXP SPACE) vs. P gap in the current knowledge, as the 
GCT chasm. The prefix GCT here refers to the location of the chasm at the junction of geometry 
and complexity theory. It does not mean that either geometry or complexity theory is necessary 
to cross it. Though both fields may turn out to be indispensable in practice. 

We conjecture that (separating) e.s.o.p.'s exist for the iir[y]*^'s under consideration and the 
coordinate rings of explicit varieties in general: i.e., Noether's Normalization Lemma for these 
rings can be derandomized (in a strong form) , as suggested by Theorems II. Ij 11.21 11-41 11.51 and 

^This is because strict NNL for V/G in Theorem II. 41 which is currently in EXPSPACE for general m, is easier 
than (i.e. can be reduced to) the strict NNL for the explicit variety in Theorem 11.61 (21 assuming that V/G for 
general m is explicit in a relaxed sense (Conjecture 18. 10) . 
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11.61 cf. Conjecture lll.il and the GCT chasm can be crossed. 

By geometric complexity theory (GCT), we henceforth mean broadly any approach to cross 
this chasm based on a synthesis of geometry and complexity theory in some form. One plausible 
approach is suggested in this article (Section [TT]) and its sequel |Mu5j based on the constructions 
in the earlier papers pST| lMS2l [MNS| IBMS] in this series. 

1.5 Proof technique 

The proof of Theorem 11.11 (a), (b), and (c) is based on the fundamental works in geometric 
invariant theory and algebraic complexity theory, specifically, the first and second fundamental 
theorems (FFT and SFT) for matrix invariants in \Pr:\ IRzj . the properties of the categorical 
quotient V/G = spec{K[V]^) proved in [MFKl iDKj . and the existence of a hitting set for PIT 
proved in |HSj . The key idea in the proof of Theorem 11.11 (a) is to show, using these FFT 
and SFT, that the categorical quotient V/G = spec{K[V]^) that arises therein is an explicit 
variety (cf. Lemma [3. 7p . Specifically, we show that, given m and r, one can construct in poly(n) 
time symbolic trace polynomials Fj{v,y), 1 < j < m^, over K, where v = {vi, . . . ,Vn) are 
the coordinates of V and y = (yi,...,y„) are auxiliary variables, such that Fj{v,yys can be 
expressed as linear combinations of linearly independent symbolic traces over y with coefficients 
in Q[v] that generate Specializing the y-variables of Fj{v,yys at the elements of any 

hitting set T for STIT, which exists by |HSj . yields a small separating set S C of invariants 

with small specifications in the form of symbolic traces. It follows, by |MFK1 IDKj . that S is 
a separating s.s.o.p.. This proof can be made constructive so as to construct an S in poly(n) 
work-space using the PSPACE-algorithm \Ko\ IKolj for Hilbert's Nullstellansatz. This proves 
Theorem ll.il (a). The proof of Theorem ll.il (b) (1) is based on the properties of the categorical 
quotient V/ G proved in [MFK^ IDKj . If the hitting set T in the proof of Theorem 11.11 (a) is 
chosen explicitly, as can be done if the black-box derandomization hypothesis for STIT holds, 
then S can be constructed in poly(n) time. In this case 5 is a separating e.s.o.p. This proves 
Theorem ll.il (b) (2). The proof of Theorem ll.il (c) is a variation of this idea. Theorem ll.il (d) 
(1) follows if one uses the fundamental PH-algorithm for Hilbert's Nullstellansatz in [Kolj in 
place of the PSPACE-algorithm used in the proof of Theorem ll.il (a). 

Theorem ll.il (e) and (f) is based on the characterization |Arj of the closed G-orbits in V and 
the polynomial time algorithms |FR1 lEbj for computing the decomposition of a finite-dimensional 
algebra. 

Theorem 11.21 (a) and (b) are reduced to Theorem 11.11 (b) and (c), respectively, using the 
fundamental equivalence between black-box derandomization of PIT and circuit lower bounds for 
EXP \H.S\ \IW\ \KI\ |Ag] , efficient factorization of multivariate polynomials given by straight-line 
programs |Klj , and the depth reduction for arithmetic circuits [VSBRl lAVt IKo2j . Theorem 11.21 
(c) and (d) are reduced to Theorem 11.21 (a) and (b) using the fundamental connection based 
on GRH between Boolean and algebraic complexities [Bui IKolj . and the TC^-algorithms for 
division and iterated multiplication jHABj . Theorem 11.21 (e) is reduced to Theorem 11.21 (d) 
using the the fundamental connection between nonuniform and uniform Boolean lower bounds 
pCLllBENWlUKW] . Theorem O (f) is obtained by extending the proof of Theorem O (a)-(e), 
plugging the A^P-oracle in the right places. 

A similar extension of the proof of Theorem 11.21 (c) , plugging the iVP^^-oracle in the right 



14 



places, yields a quasi-e.s.o.p. S for /^[y] giving the algorithm for constructing S an access to 
the NP^^ oracle, and assuming (i) GRH, and (ii) a subexponential circuit size lower bound 
for A^^^. But (ii) holds unconditionally by the fundamental subexponential circuit size lower 
bound for Af^-^ [Knl IMVW| . This implies that S is a quasi-e.s.o.p., with an access to the 
NP^^ oracle, assuming GRH alone. This proves Theorem 11.11 (d) (2). To prove Theorem 11.11 
(d) (3), one uses the fundamental circuit lower bound for MAexp |MVWj instead in the last 
step. 

The proof of Theorem 11.41 is obtained by generalizing the proof technique of Theorems 11.11 
and ll.21 in conjunction with the algorithm in |H12| for constructing finitely many generators for 
computational invariant theory |Stm21 ID]. the bounds for the degrees of the generators 
for in [P1[D], and standard monomial theory [LR^ IDRS] . 

The proof of Theorem 11.31 differs from the proof of Theorem ll.4l in two places. First, explicit- 
ness oiV/G = s'^ec{K\V]'-') for constant m can be proved unconditionally (Theorem 18. 8p using 
properties of the Reynolds operator |DKt IStin2] . standard monomial theory |LR1 IDRS] . and al- 
gebraic complexity theory \Cs\ IMP[ IStrH [Str2j . This is the heart of the proof. Second, one only 
needs the black-box (quasi) derandomization hypothesis for diagonal depth three circuits |Sxj 
in this case. This hypothesis is a consequence of the proof technique in |SVj (cf. Theorem 12. 4p . 
and is also proved in |ASSj . Hence quasi-derandomization of Noether's Normalization Lemma 
follows unconditionally when m is constant or 0(polylog(n)). This proves Theorem 11.31 

Theorems 11.51 and 11.61 are proved by abstracting the ideas in the proofs of Theorems 11.11 11.31 
andO 

1.6 Organization 

The rest of this article is organized as follows. 

In Section [2] we recall some results and notions in complexity theory and geometric invariant 
theory that we need in this paper. Theorem ll.il (a), (b), (c), (d) (1), (e), and (f) are proved in 
Section [3l These results (except for (e) and (f)) are generalized to arbitrary quivers in Section [H 
Theorem 11.21 (a), (b), and a half of (f) are proved in Section [5l Theorem 11.21 (c)-(d), and the 
other half of (f) are proved in Section [6l All the ingredients in the preceding sections are put 
together to prove Theorem 11.11 (d) (2) and (3) in Section [71 Theorems 11.31 and 11.41 are proved 
in Section [HI Generalization of Theorems 11.31 and 11.41 to classical simple algebraic groups are 
proved in Section [9j Theorems 1 1 . 5 1 and 1 1 . 6 1 are proved in Section [TOl One plausible approach to 
derandomization of PIT based on the equivalence in this article is formulated in Section [TTl 

Acknowledgment: In the first draft of this paper, Theorem 11.31 was stated and proved condi- 
tionally, assuming the black-box derandomization hypothesis for diagonal depth three circuits 
[Sxj . The author is grateful to Amir Shpilka and Michael Forbes for pointing out that this 
hypothesis (allowing a quasi prefix) is a consequence (cf. Theorem 12. 4p of the fundamental 
black-box derandomization technique in |SVj for the read-once PIT and that it is also proved in 
[ASSj . Hence, Theorem 11.31 holds unconditionally. The author is also grateful to Sanjeev Arora, 
Jonah Blasiak, Josh Grochow, and Joseph Landsberg for helpful comments. 
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2 Preliminaries 



In this section we recall some results and notions in complexity theory and geometric invariant 
theory that we need in this paper. We assume familiarity with basic complexity theory |ABj . 
algebraic geometry [Mm] , and representation theory |Fu| . 



2.1 Black-box derandomization hypothesis 

We begin by stating in more detail the black-box derandomization hypothesis for polynomial 
identity testing [HSl HWl KH IDSY] that we need. 



The polynomial identity testing (PIT) problem over a field K is the problem of deciding if a 
given arithmetic circuit C{x), x = {xi, . . . , Xr), over K of size at most s computes an identically 
zero polynomial. In this paper, the size of the circuit means the total number of edges in it. 
There is no restriction on the bit-lengths of the constants in the circuit. We also assume, unless 
stated otherwise, that K is an algebraically closed field of characteristic zero. By the PIT 
problem of small degree, we mean the PIT problem wherein the polynomial computed by the 
circuit is assumed to have a small 0{s^) degree, for some fixed constant c > 0. The article |IMj 
gives a randomized algorithm to solve the PIT problem in poly(s) operations over K. This is a 
black-box algorithm in the sense that it does not look inside the circuit. It merely evaluates the 
circuit at randomly chosen test inputs. 

The black-box derandomization problem for PIT |HS1 IIWl IKH |Ag| is to design an efhcient 
deterministic black-box algorithm for solving the PIT problem. Specifically, the problem is to 
construct efficiently a hitting set against all circuits over K with size < s and on r < s variables. 
By a hitting set, we mean a set Sr,s ^ of test inputs such that (1) the bit-length of the 
specification of each test input is poly(s), and (2) for every circuit C on K and r variables with 
size < s computing a non-zero polynomial C{x), Sr,s contains a test input b such that C{b) / 0. 
The black-box- derandomization hypothesis |HSl lIWi IKI^ |Ag[ IDSYj in this context is that there 
exists a hitting set of poly(s) total bit-size that is computable in poly(s) time. More generally, 
if a hitting set has 0{T{s)) size and is computable in 0{T{s)) time, we say that PIT for circuits 
has T(s)-time-computable black-box derandomization. By our definition of a hitting set, it is 
still required here that the bit-length of each test input in the hitting set be poly(s). 

We have also defined a restricted form of PIT called symbolic trace identity testing (STIT) 
in Section n. II It is equivalent [Bel \M.P\ Sp , up to polynomial factors, to symbolic determinant 



identity testing (SDIT) defined as follows. Let y be a variable m x m matrix. Let Y' be any 
m X m matrix, whose each entry is a homogeneous linear form over K in the variable entries 
Hij^s of Y. We call det(y) a symbolic determinant of size m. By SDIT, we mean the problem 
of deciding, given Y' , if the symbolic determinant det(y') is an identically zero polynomial in 
yi/s. The black-box derandomization hypothesis for SDIT is that, given m, one can construct 
in poly(m) time a hitting set against all non-zero symbolic determinants over K of size m. This 
is equivalent to the black-box derandomization hypothesis for STIT in Section 11.11 The parallel 
black-box derandomization hypothesis for SDIT is that a hitting set is computable by a uniform 
AC^ circuit of poly(m) bit-size with oracle access to DET (the determinant function). The 
equivalent parallel black-box derandomization hypothesis for STIT is that a hitting set against 
symbolic traces (of degree < m) over K for m x m matrices and r variables is computable by a 
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uniform AC^ circuit of poly(m, r) bit-size with oracle access to DET. 
The following result says that a hitting set for PIT exists. 

Theorem 2.1 (Heintz,Schnorr) (cf. Theorem 4-4 ISBl) Let K he any field of character- 
istic zero. There exists a hitting set B C [uY , u = 2s{d + 1)^, of size 6(s + 1 + r)^ against all 
arithmetic circuits over K and r variables of size < s and degree < d. 



The proof of this result in |HSj does not yield any efficient algorithm for constructing B. 

The following result is a variant of Theorem 7.7 in |KI] . This is why PIT is expected to have 
efficient black-box derandomization. 

Theorem 2.2 (Kabanets and Impagliazzo) (cf. Theorem 7.7 in IKlp Suppose K is afield 
of characteristic zero. 

(a) Suppose there is an exponential-time computable multilinear polynomial f{xi, . . . ,Xm) with 
integral coefficients of poly{m) bit-length such that f can not be evaluated by an arithmetic circuit 
over K 0/0(2™") size for some constant a > 0. Then PIT for small degree circuits of size < s 
has O{2P0ly^'>S{s)) 

-time computable black-box derandomization. 

(b) If f cannot be evaluated by an arithmetic circuit over K of 0{m"') size for any constant 
a > 0, m ^ oo, then PIT for small degree circuits of size < s has 0(2^^)-time computable 
black-box derandomization, for any e > 0. 

The proof of this result is very similar to that of Theorem 7.7 in |KI| (which works in the 
black-box model). Hence we omit its details and only point out how to take care of the main 
difference between the setting in |KIj and the one here. The difference is that in |KIj the size 
of the circuit is defined to be the total number of edges in it plus the total bit-length of the 
constants in it, whereas here the size just means the total number of edges. A key ingredient 
in the proof in |KI] is an efficient algorithm in |Klj for factoring multivariate polynomials (cf. 
Lemma 7.6. in |KIj ) . In its place we use instead the following result in [K H IKT| that does not 
depend on the bit-lengths of the constants in the circuit. 

Theorem 2.3 (Kaltofen) (cf. Corollary 6.2. in JK^ and Theorem 1 in \KI^ ) 

Let K be a field of characteristic zero. Suppose g{xi, . . . , x.„) is p-computable \Vlf over K . 
This means g is a polynomial of poly{n) degree that can be computed by a nonuniform circuit 
over K of poly{n) size. Then each factor of g in K[xi, . . . is also p-computable over K. 

More generally, given any polynomial g G -/^[x^^, . . . , x^] and a polynomial f G K\x\^ • • • 5 ^n] 
dividing g, there exists a nonuniform circuit over K of poly{n,deg{g)) size, with oracle gates for 
g, that computes f. 



For the converse of Theorem 12.21 see |HSt |Ag[ ISW| 



For the proof of Theorem 11.31 we will need a restricted form of PIT for diagonal depth three 
circuits [Sx]. By a diagonal depth three circuit, we mean a circuit C(x), x = (xi, . . . ,Xr), that 
computes a sum of powers of linear functions: 

C{x) = Y,l?, 
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where each /j is a possibly non-homogeneous hnear form in Xj's with coefficients in K. Here k 
is called the top fan-in of the circuit, and e = maxjej} its degree. The size of this circuit is 
s = 0{rek). 

The black-box derandomization hypothesis in this context is that a hitting set against diag- 
onal depth three circuits on r variables with degree < e and top fan-in < k can be computed in 
poly(s) time. The parallel black-box derandomization hypothesis is that such a hitting set can 
be computed by a uniform AC^ circuit of poly(s) bit-size. This holds unconditionally allowing 
a quasi-prefix: 

Theorem 2.4 (Shpilka, Volkovich; Agrawal, Saha, Saxena) (cf. Theorem 6.4 in /»S'V| / and 

appendix in jASS^ ) A hitting set against all diagonal depth three circuits over K of size < s can 
be constructed by a uniform AC^ circuit of 0{s^^^°^^^) size. 

This result is a variant of Theorem 6.4 in |SVj . with essentially the same proof |Sp| . Specif- 
ically, the partial derivative method in [ SV] (also cf. |RShj ) implies that if 2' > ke then no 
diagonal depth three circuit over xi,. . . ,Xr with degree < e and top fan-in < k can compute a 
polynomial of the form • • • XiJ{xi, . . . , Xr), for distinct ij's. This lower bound, in conjunction 
with the proof technique of Theorem 6.4 in jSVj . implies that the set of r- vectors with entries 
in 0, . . . , e that contain at most / nonzero entries is a hitting set against diagonal depth three 
circuits over xi,...,Xr with degree < e and top fan-in < k. 

2.2 Geometric invariant theory 

We now state some results from geometric invariant theory and their consequences that are 
needed for proving Theorem ll.il 

Let K be an algebraically closed field of characteristic zero, MmiK) the space of m x m 
matrices over K, and V = Mm.{KY , the direct sum of r copies of Mm{K)- Let n = d\m{y) = 
rm?'. The space V has the adjoint action of G = SLm{K): 

(^1, . . . , Ar) ^ {PAiP-\ . . . , PArP-^), (3) 

where Ai, . . . ,Ar G Mm{K) and P € SLm{K). Let Ui, . . . ,Ur be variable m x m matrices. 
Then the coordinate ring K[V] of V can be identified with the ring K[Ui, . . . , Ur] generated by 
the variable entries of Ui^s. Let C K[V] be the ring of invariants with respect to the 

adjoint action. 

Theorem 2.5 (Procesi-Razmyslov-Formanek) IPA \R4 I-Foj / (The First Fundamental The- 
orem (FFT) of matrix invariants; cf. Theorems 6 and 10 in ]Fo^ ) The ring K\V]'~^ is generated 
by traces of the form trace{Ui-^ • • • U^), I < nn? , ii, . . . ,i/ G [r] = {1, . . . ,r}. 

Furthermore, K\V]'~^ is not generated by traces of degree < (with r = m? ). 

Let i^[S'r] be the group algebra of the symmetric group Sr on r letters. Write any a ^ Sr as 
a product of disjoint cycles: 

a = (ai---afcj(6i---6fc2).... 
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where 1-cycles are included, so that each of the numbers 1, . . . ,r occurs exactly once. Define 

T^iUi,..., Ur) = T{Ua, ■ ■ ■ Ua,^ )T{U,, ' ' ' U,^^) ■ ■ ■ . (4) 

Theorem 2.6 (Procesi-Razmyslov) lPr\ \R3^ (The Second Fundamental Theorem (SFT) of 
matrix invariants; cf. Theorem 1 in ]Fo^) Let J{m, r) be the two-sided ideal of KSr which is 
the sum of all simple factors of KSr corresponding to the Young diagrams with > m + 1 rows. 
Define the K-linear map 4> : KSr — ^ by 

4>C^aa(T) = ^a^r^([/i, . ..,Ur). 

Then Ker{(j)) = J(m,r). Furthermore, J{m,r) =0 if r <m. 

Let X\, . . . , Xr he k X k variable matrices. For any word a = ii, . . . ,ii, I < k, ij G [r] = 
{1, . . . , r}, let 

r«(X) =trace(Xi, •••XiJ, (5) 

where X = (Xi, . . . ,Xr). We say that two words a and a' are equivalent, if a' can be obtained 
from a by circular rotation. In this case, Ta{X) = Ta'{X). Let [a] denote the equivalence class 
of words equivalent to a. Let T[a]{X) = Ta{X); the choice of a in [a] does not matter. 

Corollary 2.7 The traces {T[q,](X)}, where [a] ranges over all equivalence classes of words of 
length I < k, are linearly independent. 

Proof: Suppose to the contrary that there is a linear dependence 

Y,ha\T[a\{X) = ^, bia](^K. (6) 

[a] 

Without loss of generality, we can assume that this relation is homogeneous in Xj's. We can 
also assume that it is multilinear in Xj's. Otherwise, we can multilinearize it by substituting 

di 

in the l.h.s. of (0) and equating the coefficient of Y\^=i ^{ zero, where di is the degree of 

Xi in the relation, t^'s are new variables, and Xl^s are new variable k x k matrices. 

If the dependence is multilinear and homogeneous, then each [a] = [ii, . . . ,ii] corresponds 
to the permutation in Si with just one cycle {ii, . . . ,ii). We denote it by [a] again. Applying 
Theorem 12.61 with U = X, r = I, and m = A:, it follows that 6[a]'s are all zero, since J{k, I) = 
for I < k. Q.E.D. 

The following is an alternative form of Theorem 12.61 

The Cayley-Hamilton theorem implies |Prj the fundamental trace identity 
F{Ui, Um+i) = S^es„+iSign(cj)r^(C/i, . . . , Um+i) = 0, 
where the trace function To- is defined as in 
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Theorem 2.8 (Procesi-Razmyslov) (cf. Theorem 4-5 in IPrf ) The ideal of all relations 
among the trace monomial generators of K[V]^ given by Theorem \2.5\ is generated by the ele- 
ments of the form F(Mi, . . . , Mj„+i), where Mj 's range over all possible monomials in Uj 's so 
that the total length of Mi 's is < m? . 



This follows from the proof of Theorem 4.5 in [Pr] . 

We state the next results in more generality than needed for the proof of Theorem 11.11 since 
we need them in this generality for the proofs of Theorems 11.31 11-41 a^^id 19. 1[ 

So assume now that G is any reductive algebraic group defined over an algebraically closed 
field K of characteristic zero and V its any finite dimensional rational representation. Then 
the invariant ring K[V]^ is finitely generated [Hl2] . So we can consider the variety V/G = 
spec(i('[y]'^), called the categorical quotient [MFKj . It has the following property (Theorem l2.9p 
that plays a crucial role in the proof of Theorem II .11 as well as other results (Theorems 11.31 and 
II. 4p in this paper. 

Fix any set F = {/i, . . . , /^j of non-constant homogeneous generators of Consider 
the morphism TTyiQ from V to given by 

■^v/G- V ^ {fi{v),...,ft{v)). (7) 

Then V/G can be identified with the closure of the image of this morphism. Let z = (zi, . . . , z^) 
be the coordinates of K*, I the ideal of V/G under this embedding, K\V/G] its coordinate ring. 
Then K\V/G] = K[z\/I, and we have the comorphism vr^^^ : K\V/G] K\V] given by 

Since fi are homogeneous, A'[y/G] is a graded ring, with the grading given by deg(zj) = 
deg(/j). Furthermore, t^y/q gives the isomorphism between i(r[y/G] and and we have 

7T*y^^{K[V/G]) = K{Vf. 

Theorem 2.9 (Mumford [MFK] ) (cf Theorem 1.1 in \MFK^ and Theorem 4.6 and 4.7 in 

(a) The image of -Ky/Q is already closed. Hence the map ttv/g -V^V/Gis surjective. 

(b) For any x G V/G, 7ry^^,(x) contains a unique closed G -orbit. 

(c) For any G-invariant subvariety W '^V, T^v/Gi^) ^ closed subvariety of V/G. 

(d) Given v,w £ V, the closures of the G-orbits of v and w intersect iff r{v) = r{w) for all 



r 



e K[V] 



G 



The following is a graded variant of Noether's normalization Lemma implicit in its proof; 
cf. Theorem 13.3. in |DEP2j . corollary 2.29 in |Mmj . and also the proof of Theorem 1.5.17 in 

Lemma 2.10 (Graded Noether Normalization) (a) Let fi, . . . , ft be any non-constant ho- 
mogeneous generators of K[V]'-' , and H C A'[y]'-' any set of homogeneous elements such that, 
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letting I{H) denote the ideal generated by H , /|^' G I{H) for some positive integer e^, for every 
i. Then K\y]^ is integral over the suhring generated by H. Analogous result holds for any 
positively graded affine K-algebra R in place of K\V]^ . 

(b) Let E be any set of homogeneous elements in K[V/G] such that (V/G) n Z{E) = {0}. Here 
V/G is embedded in as above, Z{E) C denotes the zero set of E, and denotes the origin 
in . Then K\V/G] is integral over the subring generated by E. 

If dj's are positive integers so that gi = /^^, . . . ,gt = have the same degree, then any H, 
\H\ > dim(i^[y]'^), consisting of random (generic) hnear combinations of gi^s has the property 
in Lemma [2.101 A set H of homogeneous invariants of cardinahty equal to dim(i('[y]*^ such that 
is integral over the subring generated by H is called an h.s.o.p. (homogeneous system of 
parameters) of ^[y]^. 

2.2.1 Separating invariants 

Following Derksen and Kemper |DK| (cf. Section 2.3.2 therein), let us call a set 5* C 
separating if for any v,w £ V such that r{v) ^ r{w), for some r £ K[V]^, there exists an s G 5 
such that s{v) / s{vu). 

Theorem 2.11 (Derksen, Kemper) (cf Theorem 2.3.12 in JD^) Let S C K[V]'^ be a finite 
separating set of homogeneous invariants. Then K[V]'-' is integral over the subring generated by 
S. 

Proposition 2.12 Suppose K[V]'^ is generated by a set homogeneous invariants of degree < D. 
Then K[V]'^ has a separating set S of homogeneous invariants of cardinality < 2nD. 

Proof: Write = V x V for the product of two copies of V. Let v = {vi, . . . ,Vn) and 
vu = {wi, . . . ,Wn) be the coordinates of the two copies. Let F = {fi{v), . . . , ft{v)} be a set 
of non-constant homogeneous generators of of degree < D. For any 1 < j < I?, let 

Rj[V,G] C be the subring generated by Hj = {hi{v,w) = fi{v) — fi{w) \ deg(/j) = 

j]. Clearly d\m{Rj[V,G]) < 2n. Let Xj[V,G] = spec{Rj[V, G]) . By Noether's Normalization 
Lemma [E] (Lemma 12. lOp . each Rj[V,G] has an h.s.o.p. Ej, \Ej\ = dim{Rj[V,G]) < 2n, whose 
each element is a linear combination of /ij's in Hj. Let E = ^jEj. Each element of E is of 
the form g{v) — g{w) for some g{v) that is a linear combination of fi{vys of the same degree. 
Let S = {g \ g{v) — g{vu) S E}. Then 5 is separating. To see this, let a, 6 S ^ be two points 
such that r(a) r{b) for some r G K{V)^. Then fi{a) ^ fi{b) for some 1 < i < t. Suppose to 
the contrary, that g{a) = g{b) for all g £ S. This means every element of E vanishes at {a,b). 
Since fi{v) — fi{w) is integral over the subring generated by E, it follows that /«(«) = fi{b); a 
contradiction. Q.E.D. 

2.3 GRH and solving polynomial equations 

In this section we state the known implications of GRH in the context of solving polynomial 
equations and their consequences that we need in this paper. 
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Let /i, . . . , /s £ . . . , be integral polynomials of degree d > n and weight < w. Here 
the weight of a polynomial is defined to be the sum of the absolute values of its coefficients. 
Suppose the system of polynomial equations 

(5):/i=0,...,/, = 

is solvable over C. Let 7r(x) denotes the number of primes < x and irs{x) the number of primes 
p < X such that (S) is solvable over the finite field Fp. 

Theorem 2.13 (Biirgisser) (cf. Theorem 4-4- f-Btt) /) Assuming GRH, t:s{X) > -^x^ — 
x^/'^log{wx) for all systems (S) solvable over C. 

Theorem 2.14 (Koiran: Hilbert's Nullstellansatz is in PH assuming GRH) (cf. IKol^ ) 
Assuming GRH, the problem of deciding if a given system of multivariate integral polynomials, 
specified as straight-line programs, has a complex solution belongs to RP^^ C 112. 

This result is stated in |Kolj for a sparse representation of polynomials. But it can be seen 
to hold for a representation of polynomials by straight-line programs as well. 

Lemma 2.15 (Noether NormaUzation is in PH for the long representation assuming GRH) 

Let K he an algebraically closed field of characteristic zero. Let Z C JiT* he the variety con- 
sisting of the common zeroes of a set of homogeneous integral polynomials fi{z), . . . , fs{z), 
z = {zi, . . . , zt). We assume that fi's are specified as straight-line programs. Let N denote 
the total bit-length of the specification of fi 's. Let dim(Z) he the dimension of Z. 

(a) Consider random linear forms Lr{z) = bk,rZk, < r < where bk^r 's are random integers 
of large enough poly{N) hit-length. Let Hj. C be the hyperplane defined by Lr{z) = 0. // 
s < dim(Z), then Znf]^.Hr / {0}. If s = dim(Z), then with high probability, ZCif^^Hr = {0}, 
which implies (Lemma \2.10\) that the homogeneous coordinate ring of Z is integral over the 
subring generated by L^iz) 's. 

(h) The problem of computing the linear forms Lr{z)'s such that Znf^,^Hr = {0} belongs to 
PSPA CE. This means the specifications of such linear forms can be computed in poly{N) work 
space. 

(c) Assuming GRH, this problem is in RP^^ C 112. 
Proof: 

(a) Let us assume that s = dim(Z), the other case being easy. Let d = max{deg(/j)}. Clearly, 
d < 2^ . By raising /j's to appropriate powers, we can assume that all of them have the same 
degree D<2^ . Consider generic combinations of /j's and generic linear forms: 

^i(^) = T^iVijIiiz), ^<j<t- dim(Z), 
I^r{z) = J2k Wk,rZk, I <r < dim(Z), 

where 2/i,j's and Wk^s are indeterminates. Let R denote the multivariate resultant of Fj^s and 
Li's. It is a polynomial in yij's and Wk/s of degree < D^. By Noether 's Normalization Lemma 
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(cf. Corollary 2.29 in |Mmj and Lemma I2.10p . the system of equations ([9]) has only {0} as 
its solution for some rational values for y^j's and Wk^s. Hence R is not identically zero as 
a polynomial in yj,/s and Wkfs. By the Schwarz-Zippel lemma [Sc], we can specialize yij's 
randomly to some rational values of 0(log(-D*)) = poly(A^) bit-length so that the resulting 
specialization R' of R is not identically zero. Then R' is a nonzero polynomial in Wk^s of degree 
< D*. By the Schwarz-Zippel lemma again, R' does not vanish identically if we let w^^r = ^fc,r for 
randomly chosen integers of bit-length 0(log(D*)) = poly(A^). For such b^ys, Znf]j.Hr = {0}. 

(b) and (c): First, let us assume that we know dim(Z). Let s = dim(Z). Choose bk^s as 
above randomly of large enough poly(A^) bit-length and test if Z n f]^. Hr = {0}. The latter 
test can be done in polynomial space since Hilbert's Nullstellansatz is in PSPACE \Ko\ \Kol\ 
ICnj (unconditionally): choose random yij's and test if the resultant R above, which can be 
computed in polynomial space |Cnj . is zero. Randomization can be removed since RP SPACE = 
N PSPACE = PSPACE. Assuming GRH, this test can be done by an iJP^^-algorithm, since 
Hilbert's Nullstellansatz is then in RP^^ (Theorem EUD . 

The problem that remains is that we do not really know dim(Z) a priori. So we start with a 
guess s for dim(Z), starting with s = and increasing it one by one. At each step we randomly 
choose large enough bk^s and ?/i,j's and test if R is non-zero. As long as s < dim(Z), R will 
always be zero. We stop as soon as we find that R is not zero. Randomization can be removed 
if we only want a PSPACE-algorithm. Q.E.D. 

2.4 Succinct vs. long representation 

Lemma 12.151 shows that the problem of constructing an h.s.o.p. for the coordinate ring of a 
variety Z is in PSPACE unconditionally and in PH assuming GRH if Z is specified by writing 
down a set of generators for its ideal- we call this a long representation of Z. It does not show that 
this problem is in PSPACE or PH (assuming GRH) for the succinct representation of explicit 
varieties used in this paper. For example, V/C in Theorem 11.11 is specified by just giving m 
and r in unary, and V/G in Theorem 11.41 is specified by just giving n and m (in unary) and the 
multiplicities (in unary) of the various Weyl modules Va(G)'s in the decomposition of V . The 
bit-lengths of the long representations of these varieties are exponential in the bit-lengths of their 
succinct representations. This is because the dimension t of the ambient space containing 
V/C is exponential in the bit-length of the succinct representation. 

Hence, Lemma 12.151 can only be used (Proposition 13. ip to show that the problem of con- 
structing an h.s.o.p. for in Theorem 11.11 is in EXPSPACE unconditionally and in the 
exponential hierarchy [not polynomial) assuming GRH. Similar result also holds for any variety 
for which explicit defining equations are known akin to the ones provided for V/C in Theorem ll.il 
by the second fundamental theorem for matrix invariants [Prl IRzj ; cf . Theorem 110.71 (e) . 

For the varieties for which such explicit defining equations are not known, e.g. V/C in 
Theorem 1 1 . 3 1 or [1 . 41 Lemma 12.151 can only be used to show that the problem of constructing an 
h.s.o.p. is in EXPSPACE; cf. Proposition 18.21 This is because for such varieties the conversion 
of the succinct representation into a long representation itself takes exponential space (in the 
bit-length of the succinct representation) and double exponential time at present; cf. Lemma [8.31 
In a nutshell, this is why the existing techniques for Noether normalization applied to V/C in 
Theorem 11.41 take exponential space and double exponential time even when m is constant; cf. 
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Proposition 18. 2[ The existing techniques also take exponential space and double exponential 
time for constructing a small separating set for ^[1/]^ for similar reasons; cf. Proposition 13.21 

3 Implications of black-box derandomization of STIT 

In this section we prove Theorem ll.il (a), (b), (c), (d) (1), (e), (f). 

Let K be an algebraically closed field of characteristic zero. Let V = Mm{Ky' , with the 
adjoint action of G = SL„i{K), be as in Theorem 11.11 Let n = dim(y) = rm?. Let Ui, . . . ,Ur 
be variable mx m matrices. Identify the coordinate ring K[V] of V with the ring K[Ui, . . . , Ur] 
generated by the variable entries of Ui^s. Let C K[V] be the ring of invariants with 

respect to the adjoint action. We shall specify V and G succinctly by the pair (m, r) in unary. 
The goal is to construct a small separating set 5 C J^[y]'^ of homogeneous invariants efficiently. 

For any word a = ii, . . . ,ii, I < m^, ij £ [r], let 

r«(C/) =trace(C/i, •••[/,,), (10) 

where U = {Ui, . . . , Ur). Let 

F = {T^^0)}, (11) 

where [a] ranges over the equivalence classes (under circular rotation) of all words in 1, . . . ,r of 
length < m^. Then F generates iir[y]*^ by Theorem 12.51 

Let t = \F\ and let -Z[q,]'s denote the coordinates of A'*. Consider the map ttv/g '■ ^ ~^ -^S 

TTv/G- A=(Ai,...,A)^(...,r[,](^),...). (12) 

Then V/G is the closure of the image of this map. By Theorem 12.91 this image is already closed. 
Hence V/G = Imliry/Q) = Imliry/Q). Furthermore, 7ry^^{z[a]) = T[q,](C/) and deg{z[a]) = 
deg(r[,] ([/)). 

3.1 Construction of an h.s.o.p. 

The standard techniques for Noether normalization applied to V/G embedded in via iTy/Q 
imply the following. 

Proposition 3.1 The problem of constructing an h.s.o.p. for K[V]'^ belongs to EXPSPACE 
unconditionally. (The space is exponential in the bit-length of the succinct specification.) As- 
suming GRH, it belongs to REXP^^ . (Exponentially long oracle queries are allowed.) 

The hierarchy here is exponential and not polynomial because the dimension t of the ambient 
space containing V/G is exponential in m. The space requirement of this algorithm remains 
exponential even if we only want to construct an S of poly(n) size (not necessarily optimal) such 
that if [y]'"' is integral over the subring generated by 5. 

Proof: Let V/G be the categorical quotient embedded in if*, i < r"^^, as in eq.([T2]). Theorem 12. 81 
gives a set of exponentially many generators {gi,g2, • • •} for the ideal of V/G C if* whose spec- 
ifications as straight-line programs can be computed in poly(A^) time, where = 0(r*^("^ )) = 
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(9(2P° is the total bit-length of the straight-line programs. Applying Lemma 12.151 to 
these generators, we compute a set S C of invariants of optimal cardinality (equal to 

dim{V/G) < n) such that Er[V^]'^ is integral over the subring generated by S. Then S is an 
h.s.o.p. This is an EXPSPACE algorithm that works in poly(A^) = 2P°^y("') work-space uncon- 
ditionally. It is an iJ^XP^^-algorithm assuming GRH. Q.E.D. 

The proof of the above result implies that has an h.s.o.p. with exponential total 

bit-length of specification. It is not known if iir[y]*^ has an h.s.o.p. with sub exponential bit- 
length of specification. As such, it seems difficult to go any further than Prop 13.11 as far as the 
construction of an h.s.o.p. is concerned. 

Grobner basis theory yields the following result for the construction of a small separating 

set. 

Proposition 3.2 A small separating set S of cardinality < 2nm? can he constructed for K[V]^ 
using space that is exponential in n and time that is double exponential. 

The space and time bounds for this algorithm remain the same even if we only require \S\ 
to be poly(n). The overall bound does not change assuming GRH. This algorithm can also be 
modified to yield a separating homogeneous S of degree < 2m^ with optimal cardinality. The 
space and time bounds remain the same. 

The article |Kej . specialized to this setting, yields an algorithm for constructing a separating 
set for J^[y]'^ that works in time that is exponential in n. But the cardinality of the separating 
set constructed by this algorithm is also exponential in n. 

Proof: This follows by a straightforward constructivization of the proof of Proposition 12.12] using 
Grobner basis theory. Let F in that proof be the set of generators of degree D < m? given by 
Theorem 12.51 Let = spec(i2j[y, G]), I < j < D = m"^, and the set Hj of generators of 

Rj[V,G] be as in the proof of Proposition 12.121 The set Hj can be used to embed G] in 

tj = \Hj\. Unlike in the proof of Proposition 13. 11 where we knew explicit defining equations 
for V/G, we do not know explicit defining equations for G]. Equations for G] can 

be constructed in space that is exponential in n and time that is double exponential in n using 
Grobner basis theory (specifically, using Theorem 1 in [MR2] ) . The space bound is exponential 
in n, since tj is exponential in re. After this, we can construct an h.s.o.p. Ej of Rj [V, G] using 
Lemma 12.151 in exponential space. The set S = {g \ g{v) — g{w) G E}, E = DjEj, is then a 
small separating set. Q.E.D. 

3.2 Efficient construction of an s.s.o.p. 

By Theorem 13.61 below, the double exponential time bound for the construction of a small 
separating S in Proposition 13.21 can be brought down to polynomial, assuming black-box deran- 
domization hypothesis for STIT, and requiring |5| to be only 0(poly(re)) (instead of optimal or 
< 2nm^). We need a few definitions before we can state the result. 

Definition 3.3 We call a set S C an s.s.o.p. (small system of parameters) for K[V]'-' if 

(1) S contains poly{n) homogeneous invariants of poly{n) degree, (2) K\y]^ is integral over its 
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subring generated by S, (3) each invariant s = s{Ui, . . . , Ur) in S has a weakly skew straight-line 
program \MF^ over Q of poly{n) bit-length. 

We say that S is a separating s.s.o.p. if (2) is replaced by the stronger (2)': S is separating 
( cf. Theorem \2.11\) . 

A (separating) subexponential s.s.o.p. with index 5 > Q is defined by replacing the poly[n) 
bounds by 0{2^^'^ ^) bounds. 

We are not requiring here that the size of S be optimal (equal to dim(ii'[y]*^)). Given a 
weakly skew straight-line program for any s £ S and any rational matrices Ai, . . . , Ar G M„(Q), 
the value s(^i, . . . , A^) can be computed in time polynomial in n and the total bit-length of the 
specifications of Ai^s, and even fast in parallel by an AC^ circuit with an oracle access to DET 
[MPlICi]. 

We say that Noether's normalization lemma for is derandomized if there exists 

an explicit system of parameters (e.s.o.p.) for as defined below. We say that it is 

derandomized in a strong sense if there exists a separating e.s.o.p. for 

Definition 3.4 We say that S C is an e.s.o.p. (explicit system of parameters) if (1) 

S is an s.s.o.p., and (3) given m and r, the specification of S, consisting of a weakly skew 
straight-line program as in Definition \3.S\ for each s £ S, can be computed in poly{n) time. 

We say that S a separating e.s.o.p. if (1) is replaced by the stronger (1)': S is a separating 
s.s.o.p. 

A (separating) subexponential e.s.o.p. with index 6 > is defined by replacing the poly{n) 
bounds by 0(2'^("*)) bounds. 

The following is the full statement of Theorem ll.il (b) (1). 

Theorem 3.5 Suppose S is a separating e.s.o.p. for K\y]^ . Let tjjs '-V ^ , k = \S\, denote 
the map 

il}S ■■ V ^ {si{v),. . . ,Sk{v), 
where si, . . . ,Sk are the elements of S. It can be factored as: 

V V/G ^ K\ (13) 

(1) Given a rational v £ V, ips{'^) can be computed in time that is polynomial in n and the 
bit length of the specification of v. If v is not rational then ipsi'f) can be computed in poly[n) 
operations over K . 

(2) The image V'S'(^) ^ is a closed subvariety. The map from V/G to V'5(^) is one-to- 
one and onto. 

(3) For any x E il)s{y), ipg^{x) contains a unique closed G-orbit in V. 

This result holds for any finite dimensional rational representation V of G. The definition 
of an e.s.o.p. in this general setting is given later (Definition 18. 4p . 
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Proof: 

(1) This follows because S is an e.s.o.p. 

(2) By Theorem 12.91 (a), the first map is surjective. The image of the second map is closed 
because -fC[y/G] = -fC[F]'^ is integral over the subring generated by 5. The map ip'g is one-to- 
one because S is separating. 

(3) This follows from (2) and Theorem [2J] (b) . Q.E.D. 

The following is the full statement of Theorem ll.il (b) (2) and (c). 

Theorem 3.6 Let K he an algebraically closed field of characteristic zero. 

(1) Assume that the black-box derandomization hypothesis for STIT (symbolic trace identity 
testing) over K holds. Then K[V]'^ has a separating e.s.o.p. 

(2) Suppose STIT for m x m matrices and r variables has 0{2^^) -time- computable black-box 
derandomization, n = rrn^ , for any small constant e > 0. Then K\y\^ has a separating subex- 
ponential e.s.o.p. for any exponent 5 > 0. 

(3) The result in (1) holds, after replacing the poly{n) bound by 0(n^°S") bound, if we assume 
black-box derandomization of PIT for depth four circuits over K instead of black-box derandom- 
ization of STIT. 

(4) Assuming parallel black-box derandomization of STIT over K (cf. Section \2.1\) . the problem 
of constructing a separating s.s.o.p. for K\V]'^ belongs to DET C NC"^ C P. 

Here DET denotes [C] the class of functions that can be computed by AC^ circuits (of 
poly(n) bit-size) with oracle access to the determinant function. 

Proof: We only prove (1). The proof of (2) is similar, and so too the proof of (3), since by [AV] . 
black-box derandomization of PIT for depth four circuits over K implies 0(n'°^ "j-computable 
black-box derandomization of low-degree PIT, and hence, STIT. The proof of (4) is similar to 
that of (1) using the parallel black-box derandomization hypothesis in place of the sequential 
hypothesis. 

For any I < k := m^, consider the generic invariant 

Ti{X, U) = trace(Xi ®Ui + --- + Xr® Ur)\ (14) 

where XiS are new kxk variable matrices, X = {Xi, . . . , X^), U = {Ui, . . . , C/,.), and (8) denotes 
the Kronecker product of matrices. Thus each Xi(^Ui is an m'xm' matrix, where m' = km = ?n^, 
and for any A = (Ai, . . . ,Ar) £ Mjn{KY , Ti{X, A) is a symbolic trace polynomial over an m'xm' 
matrix in the rk'^ entries of Xi's. We have 

Ti{X,U) = Y,To.{X)T^iU) = Y,M\T[o.]{X)T^a]{U), (15) 

where [a] = [ai ■ ■ ■ ai] ranges over the equivalence classes of all words of length / with each 
aj E [r] , and | [a] \ denotes the cardinality of [a] . 

Let U' = {U'l, . . . , [/',) be another tuple of variable m x m matrices. For each I < k = in^, 
define the symbolic trace difference 
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fi{X, U, U') = Ti{X, U) - Ti{X, U'). 



(16) 



Each Ti{X, U, U') can be expressed as trace(A'';(X, U, U'Y') for some symbolic matrix Ni{X, U, U') 
of size q = poly(n), r/ < q, whose entries are (possibly non-homogeneous) linear functions over 
Q of the entries of X, U, and U' . This is because STIT is equivalent to PIT for algebraic branch- 
ing programs (cf . |MPj and Section I2.ip , and the difference between two branching programs is 
again a branching program. 

By our black-box derandomization hypothesis for STIT, there exists an explicit (poly(n)- 
time computable) hitting set B = Bs^q C for STIT for qx q matrices whose entries are linear 
functions of the s = rk"^ variable entries of Xis with coefficients in K. Fix such an explicit B. 
We think of each 6 G i? as an r-tuple h = (bi, . . . ,br) oi kxk integral matrices. By the definition 
of the hitting set and the argument in the preceding paragraph, for any symbolic trace difference 
fi{X, A, A') = Ti{X, A) - Ti{X, A'), l<k,A = (^i, . . . , Ar),A' = (^i, . . . , ^;) e M^(i^)^ that 
is not identically zero as a polynomial in the s variable entries of XiS, there exists b £ B such 
that fi{b,A,A') / 0. 

Let 

S = {Ti{b,U) \ b e B,l < I < k} C K[Vf. (17) 

Suppose A, A' G Mm{K) are two r-tuples such that for some invariant h G h{A) ^ 

h{A'). By Theorem 12.51 it follows that some generator T^^j{U) assumes different values at A and 
A'. By eq. ([E]) and Corollary [221 this implies that fi{X,A,A') = Ti{X,A) - Ti{X,A') is not 
identically zero for some I < m?. This means there exists b G B such that Ti{b, A, A') ^ 0; i.e., 
Ti{b,A) ^ Ti(b,A'). It follows that 5 is separating. By Theorem ETH it follows that K[V]^ is 
integral over the subring generated by S. Every element of S is clearly homogeneous of poly(n) 
degree. Since the hitting set B is explicit, and matrix powering, Kronecker product, and trace 
have short and explicit weakly-skew straight-line programs PiMP]. it follows from eq. that 
the specification of S consisting of a weakly skew straight-line program for its every element can 
be computed in poly(n) time. Hence 5 is a separating e.s.o.p. Q.E.D. 

For future reference, we note down the following consequences of the proof. 

Lemma 3.7 Let X = {Xi, . . . ,Xi^) be new k x k variable matrices, where k = m?. There exist 
poly{n)-time computable weakly skew circuits Ci's, I < iv? , over the variable entries of Xi's and 
Ui's such that (1) the polynomial functions Ci{X,U)'s computed by Ci's are of poly{n) degree, 
homogeneous in X and U, and can be written as 

CliX, U)=Y, fla]A^)9[a]AU), (18) 

[a] 

where [a] = [qi ■ ■ ■ ai] ranges over the equivalence classes of all words of length I with each 
^ ['']> (^) f'[a] ^'^^ linearly independent homogeneous polynomials in the entries of X, and 
(3) g[a],i{U)'s are homogeneous invariants that generate K[V]'^. 

This result, a key ingredient in the proof of Theorem 13. 6|, says that the variety V/G here 
is strongly explicit as per the general notion of explicit varieties that we shall formulate later 
(Definition 110. 2p taking V/G as a basic prototype of such varieties. 
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Proof: Let Ci be a weakly skew circuit [MPj computing Ti {X, U) , cf . eq. (fH|) , so that Ci {X, U) = 
Ti{X,U), = |[a]|r[,](X), and <7[,y(C/) = T[,](C/); cf. eq. (US]). Then /[„]X^)'s are 

linearly independent by Corollary 12.71 and g[a],i{Uys generate -ftr[y]'^ by Theorem 12.51 Q.E.D. 

Theorem 3.8 The problem of deciding if the closures of the G-orbits of two rational points in 
V intersect belongs to co-RNC . 

Proof: By Theorem l2. 91 (d) and the proof of Theorem l3.6| the closures of the G-orbits of A,A'&V 
intersect iff the symbolic trace difference Ti{X, A, A') = Ti{X,A) — Ti{X,A') is identically zero 
for some I < k = rr? . For rational A and A', this can be tested by a co-RNC algorithm |IMj : just 
substitute large enough random integer values for the entries of X and test if all the differences 
vanish. Q.E.D. 

3.3 Existence of an s.s.o.p. 

The following is the first half of Theorem 11.11 (a) . 

Theorem 3.9 The ring K[V]'^ has a separating s.s.o.p. 

Proof: Observe that STIT is a special case of the PIT treated in Theorem 12.11 In the proof of 
Theorem 13. 6| use the new hitting set B provided by Theorem 12.11 letting s, r, and d there be 
poly(n) (large enough), in place of the old hitting set B provided by the black-box derandom- 
ization hypothesis for STIT. Let S be as in (I17p with this new B. 

Since we do not have any efficient algorithm for constructing B anymore, we do not have 
any efficient algorithm for constructing S either. Hence the proof of Theorem 13.61 will only show 
now that S" is a separating s.s.o.p. instead of a separating e.s.o.p. Q.E.D. 

Let us note down the following consequence of the proof. 

Lemma 3.10 Let B be any hitting set provided by Theorem \2.1\ (with s, r, and d as in the proof 
of Theorem \3.9\) . Let S be defined as in |i7[ ) with this B. Then S is a separating s.s.o.p. 

The following result proves the second half of Theorem ll.il (a) and Theorem ll.il (d) (1). 

Theorem 3.11 Let V and G be as in Theorem \3.(A A separating s.s.o.p. can be constructed 
in poly{n) work- space. This means the problem of constructing a separating s.s.o.p. belongs to 
PSPACE unconditionally. The problem belongs to S3 C PH assuming GRH. 

For the proof we need the following lemma. 

Lemma 3.12 Let B he any potential hitting set in the setting of Theorem \2.1\ (with s, r, and d 
as used in the proof of Theorem \3. 9\) . Let S{B) be as in jiTp with this B. Then whether S{B) is 
a separating s.s.o.p. can be verified in poly{n) space. Assuming GRH, the problem of verification 
belongs to 11^. 
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Proof: Let B be as given. Then 

S{B) = {Ti{b, U)\beB,l<l<k}C K[Vf. 

Each element of S{B) is homogeneous of poly(n) degree, has a weakly skew straight-line 
program of poly(?i) bit-length, and the cardinality of S{B) is poly(n). So it suffices to check 
that S{B) is separating. 

Let = V xV. Let [/ = ([/(,... , U'.) be another tuple of m X m variable matrices whose 
entries are the coordinates of the second copy of V. Let Z(B) C V'^ be the zero set of 

S{B) = {Ti{b, U) - Ti{b, U')\b€ B,l<l<k}C K[V^]. 
Let F = {T\^a] (U)} be the set of generators for as in eq. ([TT]) . Then S{B) is separating 

iff 

(*) For every equivalence class [a] of words in 1, ... ,r of length < m^: (**) T[^-\{U) — T^a]{U') 
vanishes on Z{B). 

Using the PSPACE-algorithm for Hilbert's NullsteUansatz \Ko\ IKolj . the test (**), for any 
given a, can be carried out in poly(n) space. Using the PH-algorithm for Hilbert's NullsteUansatz 
in Theorem 12. 141 it can be carried out by a n2-algorithm assuming GRH. Thus the test (*) can 
be carried out by a PSPACE algorithm unconditionally, and by a n2-algorithm assuming GRH. 
Q.E.D. 

Proof of Theorem \3.11\ 

Using Lemma |3.12| the problem of checking if there exists B C [n]'' (with s,r,d, and u = 
2s{d + 1)^ as in the proof of Theorem 13. 9p such that S{B) is a separating s.s.o.p. belongs to 
PSPACE unconditionally, and to S3 assuming GRH. The algorithm for this check is bound to 
succeed by Theorem 13.91 Q.E.D. 

3.4 Instability flag and decomposition 

In this section we prove Theorem 11.11 (e) and (f). Let V, G, and K be as in Theorem 13. 6[ 
The following is a full statement of Theorem ll.il (e) and (f). 

Theorem 3.13 Let K be an algebraically closed field. Let V and G be as in Theorem \L1[ 

(a) Given a rational v £ V, the instability flag \Km!j (j){v) of an optimal one-parameter subgroup 
that drives v to a point in the unique closed orbit that lies in the closure of the G-orbit of v can 
be computed in time that is polynomial in n and the bit-length of the specification of v. If v is 
not rational, this flag can be computed in poly{n) operations over K. 

(b) Given a rational semi-simple v ^V, the decomposition of v into simple tuples can also be 
computed in time that is polynomial in n and the bit-length of the specification of v. If v is not 
rational, this decomposition can be computed in poly{n) operations over K. 

The instability flag is specified by the coordinates of the vectors constituting the fiag. All 
coordinates are specified by straight-line programs over some variables Ai, . . . , A^, /c = poly(n), 
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where each Aj stands for an algebraic number with the minimal polynomial over Q of poly(n) 
degree. Each Aj is specified by its minimal polynomial and a complex approximation to it of a 
high enough precision to distinguish it from other Aj's with the same minimal polynomial. We 
cannot specify the coordinates of the vectors as elements of a common extension field containing 
Aj's, since the degree of such a field can be exponential. 

Proof: The point v corresponds to a tuple {Ai, . . . , Ar) of rational m x m matrices. Consider 
the representation 

: C{Ui,...,Ur)^MmiK), 

given by Ui — t- Ai. Two representations (j)y and (j)w are isomorphic iff v and w belong to the 
same G-orbit. By Artin |Arj . closed orbits correspond to semi-simple representations and the 
representation corresponding to the unique closed orbit in the closure of the G-orbit of v is 
the semi-simplification of (py, i.e., the direct sum of its Jordan-Holder components. The flag 
4>{v) corresponding to this Jordan-Holder series in the specification we are considering can be 
computed in polynomial time using the standard techniques for decomposing finite dimensional 
algebras over algebraically closed fields; cf. |FRj and Theorem 3.5. in [Ebj . In v is semi- 
simple, the Jordan-Holder components yield its decomposition into simple tuples. The standard 
techniques yield this decomposition as well in polynomial time. Q.E.D. 

3.5 Explicit coarse classification assuming black-box derandomization for any 
algebra 

Next we note down the consequences of Theorems l3.5l 13.61 and l3.13] for arbitrary finite-dimensional 
algebras. 

Let i? be a finite dimensional associative algebra over K, specified by its basis fi,...,fr and 
the multiplicative structure constants. Let p : R ^ Mm{K) be its m-dimensional representation. 
Let n = mr'^. Two representations are isomorphic iff they lie in the same G-orbit, G = SLm{K). 
The representations with closed G-orbits are called stable or semi-simple. Each m-dimensional 
representation p oi R can be identified with the r-tuple (^i, . . . , Ar) €V = Mm{KY of m x m 
matrices, Ai = p{fi). The set Wm = Wm{R) of m-dimensional representations of i? is a closed 
G-subvariety of V . 

The following result says that black-box derandomization of STIT implies explicit coarse 
classification (parametrization and decomposition) of the finite dimensional semi-simple repre- 
sentations of A. 

Theorem 3.14 Suppose black-box derandomization hypothesis for STIT holds. Then: 

(a) There exists a separating e.s.o.p. S = Sm for K[V]'~^ for any m. This yields a one-to-one 
and onto explicit (polynomial time computable) map il^s from the closed G-orbits in Wm to the 
points ofips{Wm) C K'', k = \S\ = poly{n). Furthermore, ipsiWm) is a closed subvariety of . 

(b) Each p € Wm can be associated an explicit (polynomial time computable) instability flag (f){p) 
of an optimal one-parameter subgroup driving p to the unique closed G-orbit in the closure of 
the G-orbit of p. 

(c) Given a rational semi-simple p G Wm, the decomposition of p into simple representations 
can also be computed in time that is polynomial in n and the bit-length of the specification of p. 



31 



If p is not rational, this decomposition can be computed in poly{n) operations over K. 



This result follows from Theorems 13.61 13.131 and the following result. 
Proposition 3.15 Suppose S is a separating e.s.o.p. of K\V]^ . Then: 

(a) Given any rational m- dimensional representation p of R, ipsip) can be computed in time 
polynomial in n and the bit-length of the specification of p. If p is not rational, then ips{p) can 
be computed in poly{n) arithmetic operations over K . 

(b) The image W^^ = ipsiWm) is a closed subvariety of , k=\S\= poly{n). 

(c) For any x G W^, ^|Jg^{x) contains a unique closed G-orbit in Wm- 

Proof: 

(a) This follows since S is an e.s.o.p. 

(b) By Theorem 12.91 (c), Y = TTv/GC^m) is a closed subvariety of V/G. The image ijj'g{Y), with 
ip'g as in eq. ([H]), is closed since K[V/G] = K[Vf is integral over the subring generated by S 
and hence ip'g is a finite morphism. 

(c) This follows from Theorem 13.51 (3). Q.E.D. 



4 Generalization to quivers 

We now generalize Theorems 13.61 and 13.81 to arbitrary quivers. 

Let Q be a quiver \BP\ IDW] . i.e., a four-tuple {Qo,Qi,t, h), where Qo = {1, . . . ,1} is a set 
of vertices, Qi is a finite set of arrows between these vertices, and the two maps t,h : Qi ^ Qq 
assign to each arrow cp G Qi its tail t{(j)) and head h{(j)). Loops and multiple arrows are allowed. A 
representation W of the quiver Q over a field K is a family : i G Qq} of finite dimensional 

vector spaces over K together with a family of linear maps W{(j)) : W{t{(j))) — )• W{h{(j))), 
(j) € Qi- The /-tuple of integers dim(VF) = {dim(VF(z)) | i G Qq} is called the dimension vector 
of VF. A morphism between two representations / : Wi — )• W2 is a family of linear morphisms 
{/(i) : Wi{i) ^ W2{i^ I i e Qo} such that, for ah G Qi, W2{^) o = o Wi{^). 

For a fixed dimension vector m = (m(l), . . . , m{l)) G N', the representation space V = V{Q, m) 
of the quiver Q is the set of all representations W oi Q such that W{i) = for all i G Qq. 

Clearly, 

V = V{Q,m) = ©<^eQiHom,,(E:-(*W),E:-('^W)) = ®4,^q,M^{K), (19) 

where M^[K) denotes the space of m{h{(f))) x m[t[(j))) matrices with entries in K. There is a 
canonical action of 

I 

i=l 

on V defined by 

{g - wm = g{hmW{<P)g{tm-^ 
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for any g = {g{l), . . . , g{l)) G G and W G V{Q,'m). The G-orbits in V are precisely the 
isomorphism classes of representations of Q with the dimension vector m. 

Assume that K is of characteristic zero. Let U = (. . . ,U^, . . .) be the tuple of variable 
matrices, where is a m{h{cj))) x m{t{(j))) variable matrix. Then the coordinate ring K\y] of 
V can be identified with the ring K[U] over the variable entries of C/^'s. Let K[V]^ C K[V] be 
the subring of G-invariants. When Q consists of a single vertex with r self-loops and dimension 
m, coincides with the invariant ring in Theorem 13.61 If Q has no directed cycles then 

= K. So we are mainly interested in the case when Q has directed cycles. 

The following result generalizes Theorem 13.61 to arbitrary quivers. 

Theorem 4.1 Let K he an algebraically closed field of characteristic zero, andn = \Qo\ + \Qi\ + 
\m\, where \m\ = X^igg^ Then: 

(a) The analogue of Theorem \3.6\ holds for V and G as above. 

(b) Analogues of Theorems 13.^1 [XPt and \3.1I\ also hold. 

For a proof, we need the following generalization of Theorem 12.51 

Theorem 4.2 (Le Bruyn-Procesi) WP^ The ring KlV]'-' is generated by traces of the form 
trace{U^^ •••U^i), I < where {(pi, . . . ,(pi), (pj G Qi, denotes a directed cycle in the quiver 

Q. 

Proof of Theorem I4.lt 

(a) The proof is very similar to that of Theorem 13.61 So we only indicate the differences. 

For any cycle a = {(pi, . . . ,(pi), (pj G Qi, I < |mp, in the quiver Q, let Ta{U) = trace(C/0^ • • • C/^;). 
We call two cycles equivalent if one is obtained by a cyclic rotation from the other. Let [a] de- 
note the equivalence class of a, and let T^^t^{U) = Ta{U). The generalization of F in eq. (jlip is 
now 

F = {T[,](^)}, (20) 

where [a] ranges over the equivalence classes of cycles of length < \m\^. By Theorem 14. 2| F 
generates Let t = \F\. The map ttyiq : V — t- in (jl2p now generalizes to 

TiyiG-. A = {...,A^,...)^{...,Ty^^{A),...). (21) 

For any / < k := |mp, the generic invariant in (|14p now generalizes to 

Ti{X, U) = trace(M(X, ^7)'), (22) 

where X = {. . . , X^, . . .) is a tuple of new k x k variable matrices X^'s, M{X, U) denotes the 
\Qo\ X IQol block matrix whose {i,j)-th. block, i,j G Qo, is the matrix '^^X^f^ (g) U^, where (p 
ranges over all arrows in Qi with tail i and head j, and ® denotes the Kronecker product. For 
any A = {. . . ,A^, . . .) G V, Ti{X,A) is a symbolic trace polynomial over an m' x m' matrix, 
m' = dim{M{X,U)) = poly(n), in the s = ^^dim(X(^)^ = poly(n) variable entries of X^^s. 
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The rest of the proof proceeds as that of Theorem 13.61 with Ti{X, U) m (j22p m place of Ti{X, U) 
in dm). 

(b) The proof of (a) can be extended just as the proof of Theorem 13.61 is extended to prove 
Theorems ESI [321 and Em Q-E.D. 

Conjecture 4.3 Analogue of Theorem \3.13\ (a) also holds for V and G as in Theorem \4-l\ 

The proof of Theorem l3.13l can not be extended as it is since a characterization of semi-simple 
representations of a quiver analogous to Artin's characterization |Ar| of semi-simple tuples is 
not known. 

5 Implications of lower bounds for EXP in characteristic zero 

In this section we prove Theorem ll. 21 (a). (b), and a half of (f). The following is its full statement. 

Theorem 5.1 Let K he any algebraically closed field of characteristic zero. Assume that V and 
G are as in Theorem \3.6l the case in Theorem \4-i\ being similar. 

(1) Suppose there exists an exponential-time- computable multilinear polynomial f{xi,...,Xr) 
with integral coefficients of poly{r) bit-length such that f can not be evaluated by an arithmetic 
circuit over K of size 0(2''°) and depth 0{r"'), for some constant a > 0. Then K\V]'^ has a 
separating quasi-e.s.o.p. 

(2) Suppose f as in (1) can not be evaluated by an arithmetic circuit over K of size 0{r"') and 
depth O(log^r), for any constant a > 0. Then K\y\^ has a separating subexponential e.s.o.p. 
for any index 5 > 0. 

(3) The implication in (1) also holds f there can not be evaluated by an arithmetic circuit over 
K of size Oi^l"" ) and constant depth, for some constant a' > 0. 

(4) The implication in (2) also holds if f there cannot be computed by an arithmetic circuit over 
K of constant depth and size 0(n*^(^'°S")). 

(5) The implications in (l)-(4) hold if f is instead exponential-time- computable with access to 
the N P-oracle, giving the algorithm for constructing S an access to the NP-oracle in this case. 

For a proof we need the following result. 

Theorem 5.2 (Agrawal,Vinay; Koiran) Let K be a field of characteristic zero. 

(a) [AgrawaljVinay], cf. JAV^ : A multilinear polynomial in r variables that can be computed by 
an arithmetic circuit over K of 2°^^^ size can also be computed by a depth four arithmetic circuit 
over K of 2"^'^^ size. 

(b) [Koiran], cf. \Ko^ : A multilinear polynomial in r variables that can be computed by an 
arithmetic circuit over K of poly{r) size can also be computed by a depth four arithmetic circuit 
over K o/ O(rO(v^i°g0) 

size. 

Proof of Theorem \5.1\ 
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(1) The lower bound assumption implies that / can not be evaluated by an arithmetic circuit over 

a' 

K of size 0(2*" ), a' = a/2, without any depth restriction. To see this, suppose to the contrary, 

that / can be evaluated by an arithmetic circuit C' over K of size s = 0(2'' ). Applying the 
parallelization technique in |VSBRj to C", it follows that / can be evaluated by an arithmetic 
circuit C over K of size poly(s) = 0(2'"'') and depth 0((logs + log(deg(/))) log s) = 0(r"); a 
contradiction. 

It now follows from Theorem 12.21 (a) that PIT for small degree circuits of size < s has 
Q^2Polylo§('^))-time computable black-box derandomization. In particular, STIT for m x m 
symbolic matrices and r variables has 0(2Polylog(^) 

)-time computable black-box derandomiza- 
tion, s = rm?. Now we can apply Theorem 13.61 (1), or rather its proof, with the following 
modification. In Theorem 13.61 (1) it is assumed that STIT has polynomial time computable 
black-box derandomization, rather than quasi-polynomial time computable black-box deran- 
domization as here. But it can be checked that the proof of Theorem 13.61 (1) goes through 
replacing polynomial bounds by quasi-polynomial bounds. 

(2) This follows similarly from Theorem 12.21 (b) and Theorem 13.61 (2). 

(3) By Theorem 15.21 (a), the assumption in (3) implies the assumption in (1). So (3) follows 
from (1). 

(4) By Theorem 15.21 (b). the assumption in (4) implies the assumption in (2). So (4) follows 
from (2). 

(5) The proof of (l)-(4) relativizes. Q.E.D. 

The following result is a consequence of the proof of Theorem 15. 11 (1) letting / in Theorem l2.2l 
(a) be the permanent function. 

Theorem 5.3 Let V,G, and K he as in Theorem \5.1[ Suppose perm{X), X a variable m x m 
matrix, cannot be computed by arithmetic circuits over K 0/0(2™°) size and 0{m°') depth for 
some constant a > 0. Let B be the hitting set for STIT for symbolic matrices of (large enough) 
poly{n) size and poly{n) variables constructed using the arithmetic NW-generator fNW\ \K]^ based 
on the permanent as a hard function. Then {Ti{b,U) \ b B,l < k = -m?}, cf. eq. (11), is a 
separating quasi-e.s.o.p. for K[V]^ . 



6 Implications of Boolean lower bounds for EXP 

In this section we prove Theorem 11.21 (c). (d), (e), and the second half of (f). The following is 
its full statement. 

Theorem 6.1 Assume that the Generalized Riemann Hypothesis (GRH) holds. Let K be any 
algebraically closed field of characteristic zero. 

Assume that V and G are as in Theorem \3.6\ the case in Theorem \4.1\ being similar. 

(1) Suppose EXP 2 i.o.nonuniform-NC[n'',2"'^], for some constant e> 0, or alternatively, that 
EXP 2 io.rO°[2"/"], for some constant a > 1. Then K[V]^ has a separating quasi-e.s.o.p. 

The prefix i.o. can be removed from the assumption and added to the conclusion. This means 
the conclusion then holds for infinitely many n. 
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(2) Suppose EXP % i.o.nonuniform-NC^ , or alternatively, that EXP ^ i.o.rC°[ri'^(v^^°*5")]. 
Then K[V]'^ has a separating suhexponential e.s.o.p. for any index 5 > 0. The prefix i.o. can 
he removed from the assumption and added to the conclusion. 

(3) Analogues of (1) and (2) also hold if we replace EXP by EXP^^ and give the algorithms 
for computing S an access to the NP-oracle. 

(4) If EXP 2 MA then there is a suhexponential time algorithm for constructing a separating 
suhexponential- s.s.o. p. for K\y]'^ , for any exponent (5 > 0, that is correct for infinitely many 
n. If EXP^^ % n n^, or NEXP ^ MA, then there is a suhexponential time algorithm 
for constructing a separating suhexponential- s.s.o. p. for K[V]^ , for any exponent S > 0, that is 
correct for infinitely many n, giving the algorithm for constructing S an access to the NP-oracle. 

For a proof, we need the following result based on Biirgisser (Theorem 12. 13p . 

Lemma 6.2 Let f he an n-variate polynomial of poly{n) degree with non-negative integral co- 
efficients of poly[n) hit-length. Let K he any field of characteristic zero. 

(a) Suppose f can he computed hy a nonuniform arithmetic circuit C over K of size s = 
assuming GRH, there exists a nonuniform Boolean circuit C2 of poly[s) 
size and 0(log^(s) log(n)) depth such that C2{x) = f{x) (mod 2), for all x G {0, 1}". 

(h) Suppose f can he computed hy a nonuniform constant- depth (unbounded fan-in) arithmetic 
circuit C over K of size s = 0(2P0^y(")). Then, assuming GRH, there exists a Boolean constant- 
depth threshold circuit C2 of poly{s) size such that C2{x) = f{x) {mod 2), for all x G {0, 1}" . 

Proof: 

(a) The proof is very similar to that of Theorem 4.5 in |Bu] . Suppose / can be computed by 
a nonuniform arithmetic circuit C over K of size s = 0(2?°^^^")). Without loss of generality, 
we can assume that C has 0(log^(s)) depth. Otherwise, using the parallelization technique in 
[VSBR] ■ C can be replaced with an equivalent circuit of size poly(s,n) = poly(s) and depth 
0((log s)(log s + log(deg(/)))) = 0(log^(s)) = 0(poly(n)). Let C be the circuit obtained from 
C by replacing each constant Cj G K in C by a new variable yi. Let y = {yi, . . . , yg) be the tuple 
of the newly added variables, and c = (ci, . . . ,Cg) the tuple of the replaced constants. Then 
C{x, y) is an integral function with the degree as well as the logarithm of the weight bounded by 
2Poly("). Clearly, C{x,c) = C{x) = f{x). Hence, the integral system of polynomial equations 
over the variables y 

{Sn): C{x,y)- fix) = for anxG{0,...,l}" 

has a solution y = c G K'^. By Hilbert's Nullstellansatz, also has a solution over the 

algebraic closure of Q and hence over C. Since the degree of f{x) is poly(?i) and the coefficients 
of f{x) are non-negative integers with poly(n) bit-length, < f{x) < 2"", for all x G {0, . . . , 1}"", 
for some constant a > 0. Assuming GRH, by Theorem 12.131 '''"5„(2" ) > 2"" for some constant 
6 > 0, ?i — )• 00. Hence, there is some prime pn satisfying n°- < log2(pn) < such that (S'„) is 
solvable over Fp^. Let y„ G Fp^ denote such a solution. Let Cp^ denote the circuit over Fp„ 
obtained by substituting y = y^ in C. Its size is poly(s) and depth is 0(log^(s)). For any 



36 



X G {0, 1}", < f{x) < 2"" < Pn, and hence, f{x) = Cp^{x) by (Sn), and f{x) (mod 2) is the 
last bit in the binary representation of Cp„{x). 

The addition and multiplication in Fp^ can be performed by Boolean circuits of size (logpn)'^^^-' = 
poly(n) and depth O(loglogpn) = 0(log(n)). Hence, using Cp„, we can construct a Boolean 
circuit Cp^^ of poly(s) size and 0(log^(s) log(n)) depth such that the output of Cp^{x), for all 
X G {0, 1}", is the binary representation of f{x). The last bit of this output is f{x) (mod 2). 
Hence, using C^^, we get the required Boolean circuit C2 that just outputs the last bit of C'p^{x). 

(b) The proof is similar to that of (a). The circuits C,C, and Cp^ now have constant depth. 
Since iterated addition, iterated multiplication and division are in TC^ |HABj . the unbounded 
fan-in addition and multiplication gates over Fp^ in Cp^ can be simulated by Boolean constant 
depth threshold circuits of 0(poly(s)) size. This yields a Boolean constant-depth threshold 
circuit C2 of 0(poly(s)) size. Q.E.D. 

Proof of Theorem \6.1[ 

(1) If we take the first assumption, then there exists a Boolean function h{x) = h{xi, . . . , x„) G 
EXP that is not in i.o.nonuniform-NC[n'^, 2*^'] for some constant e > 0. This means h{x) does 
not have a non-uniform circuit of depth n"^ and size 2"^ for all large enough n. If we take the 
second alternative assumption instead, then h is not in i.o. rC°[2"/»] for some constant a > 1. 
This means h does not have a TC^ circuit of size 2"/" for all large enough n. 

Since Xj's here are Boolean variables, we can write h as 

Hx)=^af,n, (23) 

where // ranges over all monomials in Xj's, with degree in each Xi either or 1, and G {0, 1}. 
Define multilinear f{x) = f{xi, . . . , x„) G Z[xi, . . . , x„] by 

/(x) = J]a^//, (24) 

where /i and are exactly as in eq. (|23p . We only think of the variables Xj's as integer variables 
now, instead of Boolean variables. Clearly, f = h, mod 2. Since h G EXP, the expanded form 
(j23p of h{x) can be computed in exponential time. After this, given any input b G Z", f{b) G Z 
can be computed in time that is exponential in n and the bit-length of b. 

Claim 6.3 Assuming GRH, the integral polynomial f cannot be evaluated by an arithmetic 
circuit of size 0(2"' ), e' > a small enough constant, n — )• 00, over any field K of characteristic 
zero, if we take the first assumption. It cannot be evaluated by an arithmetic circuit over K of 
size 2°^'^\ if we take the second assumption. 

Proof of the claim: 

Take the first assumption. Suppose to the contrary that / can be evaluated by an arithmetic 

circuit over K of size 0(2"' ). By Lemma[62](a), h{x) can be computed by a Boolean circuit 

of size 0(2""' ) and depth 0(n^^'), for some constants a, 6 > 0. Choosing e' small enough, this 
contradicts our assumption that h is not in i.o.nonuniform-NC[n^, 2"']. 
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Now take the second assumption. Suppose to the contrary that / can be evaluated by an 
arithmetic circuit over K of size 

0(2''(")). By Theorem (a), / can be evaluated by a depth 
four circuit over K of size 0(2''(")). By Lemma E2](b), h{x) can be computed by a constant- 
depth Boolean threshold circuit of size 2°^"). This contradicts our assumption that h is not in 
i.o.TC^lT^/"]. 

This proves the claim. 

Now (1) follows from this claim and Theorem 15.11 (1). 

If we remove the prefix i.o. from the assumption, then the proof goes through as long as we 
add the prefix i.o. to the conclusion. The changes are straightforward and we omit the details. 

(2) For the first assumption, this follows similarly from Theorem 15.11 (2) and Lemma 16.21 (a). 
For the second assumption, this follows similarly from Theorem 15.11 (4) and Lemma 16.21 (b). 

(3) This follows by relativizing the proofs of (1) and (2). 

(4) By Babai, Fortnow, Nisan, and Wigderson |BFNW| . EXP % MA implies EXP % P/poly 
and hence EXP ^ nonuniform-NC Hence the first statement follows from (2). By Buhrman 
and Homer |BuHj . EXP^^ g n Hf imphes EXP^^ ^ P/poly, and by Impaghazzo, Ka- 
banets, and Wigderson [IKWj . NEXP C P/poly iff NEXP = MA. Hence the second statement 
follows from (3). Q.E.D. 

7 The implications of GRH alone 

We now put all the ingredients in the preceding sections together to prove Theorem 11.11 (d) (2) 
and (3). The following is the full statement of Theorem ll.il (d) (2). 

Theorem 7.1 Suppose GRH holds. 

Let V and G he as in Theorem 13. g|, the case in Theorem \4-l\ being similar. Then the prob- 
lem of derandomizing Noether's normalization lemma for K[V]'-' in a strong form belongs to 
^quasi P rpj^j^g means K[V]'^ has a separating quasi-e.s.o.p., giving the algorithm for construct- 
ing S an access to the NP^^ -oracle. 

For the proof we need the following last ingredient. 

Theorem 7.2 (Kannan, Miltersen, Vinodchandran, Watanabe) (cf. lKn\\MVW^ ) There 
is a language in A3 = ETIME^^ that does not have circuits of size T^l"" for all n, for some 
constant a. 

Proof of Theorem \7.1\ 

Relativizing the proof of Theorem 16.11 (1) with respect to the A^P^^-oracle, it follows that 
iir[y]*^ has a quasi-e.s.o.p., giving the algorithm for constructing S an access to the A^P^^-oracle, 
assuming that (a) GRH holds, and (b) some function in EXP'^^^^ does not have subexponential 
size circuits for all n. The assumption (b) now holds unconditionally by Theorem 17. 2[ Q.E.D. 

This proof also yields: 
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Theorem 7.3 Assume that GRH holds. Let K he an algebraically closed field of characteristic 
zero. Then the black-box derandomization problem for PIT for small degree circuits over K 
belongs to Ag*"^^* . This means the PIT for small degree circuits over K has 0(2P»^2/^»5{^)). 
time computable derandomization with access to the NP^P -oracle. 

The problem also belongs PSPACE unconditionally and to assuming GRH; cf. Theo- 
rem [103 

The following is the full statement of Theorem ll.il (d) (3). 

Theorem 7.4 Let V and G be as in Theorem \3.6l the case in Theorem \4.1\ being similar. Sup- 
pose GRH holds. Then the problem of derandomizing Noether's normalization lemma for K[V]^ 
in a strong form also belongs to i.o-MAquasi-haif-EXP- This means there is an MAquasi-haif-EXP- 
algorithm for constructing a separating quasi-half-exponential s.s.o.p. as defined below that works 
correctly for infinitely many n. 

Here we think of MA as a class of functions rather than decision problems. A quasi-half- 
exponential function is a function of the form e{{e_i/2{n'"'))^), where a and b are constants, and 
ea{n), for a rational a, is the o'th iterate of the exponential function e(n) as defined in |MVW] . 
A function of the form 6^/2 (™'^) is called a half-exponential function. A quasi-half-exponential 
s.s.o.p. is defined like a quasi s.s.o.p. using quasi-half-exponentials instead of quasi-polynomials. 

Proof: The result is proved just like Theorem 17.11 using Theorem 17.51 below in place of Theo- 
rem [72] and Theorem 17.61 below in place of Theorem 12.21 (a). Q.E.D. 

Theorem 7.5 (Miltersen, Vinodchandran, Watanabe) (cf. IMVW^ ) There is a language 
in MAexp tho-t does not have circuits of half- exponential size for infinitely many n. 

Also see a related bound in Santhanam |Saj . If this result can be strengthened to show that 
MAexp does not have sub-exponential size circuits for all n, then it would follow from the proof 
of Theorem 17.41 that the problem of derandomizing Noether's Normalization Lemma for /^[y]'^ 
in a strong form is in AIA assuming GRH. 

Theorem 7.6 // / as in Theorem \2.2\ cannot be evaluated by an arithmetic circuit over K of 
half- exponential size, then PIT for small degree circuits over K has quasi-half- exponential-time- 
computable black-box derandomization. 

The proof is similar to that of Theorem 12.21 

The proof of Theorem 17.41 also yields the following result. 

Theorem 7.7 Suppose GRH holds. Let K be an algebraically closed field of characteristic zero. 
Then the black-box derandomization problem for PIT for small degree circuits over K also belongs 

to i.O.MAquasi-half~EXP- 
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8 The general ring of invariants 



In this section we prove Theorems 11.31 and 11.41 

Let K be an algebraically closed field of characteristic zero. Let y be a polynomial repre- 
sentation of G = SLm{K) of degree < d and dimension n. Since G is reductive |Fuj . V can be 
decomposed as a direct sum of irreducibles: 

V = ®xm{X)Vx{G). (25) 

Here A : Ai > • • • A^. > 0, r < m, is a partition, i.e., a non-increasing sequence of positive integers, 
V\{G) is the irreducible Weyl module |Fuj of G labelled by A, and m(A) is its multiplicity. The 
degree d is the maximum of |A| = Aj for the A's that occur in this decomposition with nonzero 
multiplicity. It is easy to see that d <n (Lemma 18. 17p . We assume that V and G are specified 
succinctly by the tuple 

(F, G) := (n, m; {\\m{\^))- . . . ; (A^ m(A^))) (26) 

that specifies n and m in unary and the multiplicity m{X') (in unary) of each Weyl module 
(G) that occurs in the decomposition (125p with nonzero multiplicity. For each copy of Vx{G) 
that occurs in this decomposition, fix the standard monomial basis of Vx{G) as defined in 
[DEPll ILRj . It will be reviewed in Section [8.31 below. This yields a basis B{V) of V, which we 
call the standard monomial basis of V. Let vi, . . . ,Vn be the coordinates of V in this basis. 

Let C K[V] be the ring of invariants, and V/G := spec{K[V]'-'), the categorical 

quotient |MFK] . 

Theorem 8.1 (Derksen) (cf. Theorem 1.1, Proposition 1.2 and Example 2.1 in W^) The ring 
K\y]^ is generated by homogeneous invariants of degree < I = nrn^dP^ . 

This bound is poly(n) when m is constant, since d <n (Lemma 18. 17p . 

Let i^r[y]p C be the subspace of invariants of degree and the subspace of 

non-constant invariants of degree < /. The spaces and iir[y]<; are defined similarly. The 

dimension t of is bounded by dim(K[y]<;) = Yl,c<i This is exponential in n, 

even when m is constant. This worst case upper bound on t is not tight. But we can not expect 
a significantly better bound since, the singularities of being rational |Btj . the function 

h{l) = d\m.{K\y\f) is, by [F], a quasi-polynomial of degree dimiV/G) > dim{V) — dim(G) = 
n — w? . 

Let F = {/i, . . . , /t} be a set of non-constant homogeneous invariants that form a basis of 
K^\%. By Theorem EH F generates Consider the morphism vry/^j from V to given 

by 

vry/c: v ^ {hiv), . . . , h{v)). (27) 

By Theorem 12.91 the image of this morphism is closed, and V/G can be identified with this 
closed image. Let z = {zi,...,zt) be the coordinates of K*^, I the ideal of V/G under this 
embedding, its coordinate ring. Then K[V/G] = K[z]/I, and we have the comorphism 

TT* : K[V/G] ^ K[V] given by 
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(28) 



Since /j's are homogeneous, K[V/G] is a graded ring, with the grading given by deg(2:i) = 
deg(/j). Furthermore, T^y/Q gives the isomorphism between K[V/G] and ^[V]*^: 

7r*y^a(.K[V/G]) = K{Vf. 

8.1 Construction of an h.s.o.p. 

Grobner basis theory imphes the following result. 

Proposition 8.2 The problem of constructing an h.s.o.p. for K[V]'^ belongs to EXPSPACE. 

The algorithm here needs exponential space and double exponential time even when m is 
constant because, as already observed, the dimension t of the ambient space containing V/ G, 
cf. eq. (j27p . is exponential in n even when m is constant. Furthermore, the time requirement 
of this algorithm remains double exponential even if we only want to construct an S of poly(n) 
size (not necessarily optimal) such that is integral over the subring generated by S. 

If the second fundamental theorem (SFT) akin to that for matrix invariants (Theorem 12. 8p 
holds for (cf. Definition 110. 8p then the problem of constructing an h.s.o.p. can be put 

in REXP^^ assuming GRH; cf. Theorem HOTl 

It follows from Sturmfels [Stm2j (cf. Theorem 4.7.1. therein) and Lemma 12.101 that 
is integral over the subring generated by any homogeneous basis F' = {f [,..., ff} of 
I' = m2((fm+l)™ . We can use F' instead of F in the algorithms of this section. liV = V(rf) (G), 
then n = dim{V) = ('^^"^Y^), and t' is still exponential in n for constant m. So the algorithm 
in Proposition 18.21 will still need exponential space and double exponential time for constant m. 

For the proof of Proposition 18.21 we need the following result. 

Lemma 8.3 Let F = {fi, . . . , ft} be the set of generators of K[V]'^ as in \21l^ . A generating 
set of syzygies among the elements of F can be constructed in work-space exponential in n and 
m and time that is double exponential in n and m. 

Proof: 

Let TTy/G be the morphism from V to based on F as in e.q. (I27p . Using this embedding 
of V/ G and the Grobner basis algorithm in |MR2j , we can compute a generating set of syzygies 
among the elements of F. This algorithm works in space that is exponential in dim.{V/G) < n, 
polynomial in the dimension t of the ambient space, and poly-logarithmic in the maximum 
degree of the elements in F; cf. Theorem 1 in |MR2j . This work-space requirement is clearly 
single exponential in n and m (since d < n). Q.E.D. 

Proof of Proposition \8.2[ 

This is proved just like the first statement of Proposition 13.11 using Lemma 18.31 instead of 
Theorem ESI Q.E.D. 
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For constant m the proof of Proposition 12. 12] can be constructivized (cf. the proof of Propo- 
sition [3?2]) , using F as in eq. (p7|) . to construct a separating S of cardinahty < 2nl = poly(n), 
where / is as in Theorem 18.11 This algorithm also takes space that is exponential in n and time 
that is double exponential. 

8.2 On derandomization of Noether's normalization lemma 

The following result (Theorem 18.51 (2)) says that the double exponential time bound in Propo- 
sition [82] can be brought down to quasi-polynomial for constant m (even for the strong form of 
NNL) requiring \S\ to be 0(poly(n)) (instead of optimal). Before we can state the result, we 
need a few definitions. 

We say that Noether's normalization lemma for is derandomized if there exists an ex- 

plicit system of parameters (e.s.o.p) for as defined below. We say that it is derandomized 

in a strong sense if there exists a separating e.s.o.p. for 

Definition 8.4 (a) We say that S C is an s.s.o.p. (small system of parameters) for 

K[Vf if (1) K[Vf is integral over the subring generated by S, (2) the cardinality of S is 
poly{n,m), (3) every invariant in s is homogeneous of poly{n,m) degree, and (4) every s £ S 
has a small specification in the form of a weakly skew straight-line program JMP^ of poly{n, m) 
bit-length over Q and the coordinates vi, . . . ,Vn ofV in the standard monomial basis. 

(h) We say that a subset S C is an e.s.o.p. (explicit system of parameters) for K[V]^ 

if (1) S is an s.s.o.p. for K\y]^ , and (2) the specification of S, consisting of a weakly skew 
straight-line program as above for each s £ S, can be computed in poly{n, m) time, given n, m, 
and the nonzero multiplicities m{\) 's of Vx{G) 's as in eq. [25\) . 

(c) Quasi-s.s.o.p. and quasi-e.s.o.p. are defined similarly by replacing the poly{n,m) bounds by 
Q|-2po/j//o(7(n,m)^ IjQy^jidg^ Pq^ q constant m, subexponential- s.s.o.p. and subexponential- e.s.o.p. 
with exponent (5 > are defined by replacing the poly{n,m) bounds by 0(2*^^" ^) bounds. 

(d) S.s.o.p., e.s.o.p., and the related notions are defined in a relaxed sense by dropping the weakly 
skew requirement in (a) (4) and the degree requirement in (a) (3). 

Here (4) in (a) is equivalent |MPj to (4)': every s £ S can be expressed as the determinant of 
a matrix of poly(n, m) size whose entries are (possibly non-homogeneous) linear combinations of 
vi, . . . ,Vn with rational coefficients of poly(n, m) bit length. By |Cs| . it follows that, given such 
a weakly-skew straight-line program of an invariant s £ S and the coordinates of any rational 
point V £ V, the value s{v) can be computed in time that is polynomial in n, m, and the total 
bit-length {v) of the specification of v, and even fast in parallel, by a uniform AC^ circuit of 
poly(n,m, (v)) bit-size with oracle access to DET. 

The following is the full statement of Theorem 11.31 
Theorem 8.5 

(1) Suppose the black-box derandomization hypothesis for PIT for diagonal depth three circuits 
over K holds (cf. Section \2.1\) . Then K[V]^ has a separating e.s.o.p., if m is constant. 
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Specifically, there is exists a set S CI K[V] of poly{N) invariants, N = n*" dJ^ , such that 

(1) S is separating, and hence (cf. Theorem \2.11\) K\VY' is integral over its suhring generated by 
S, (2) every invariant in S is homogeneous of poly{N) degree, (3) every s £ S has a weakly skew 
straight-line program over Q and vi,. . . ,f„, of poly{N) hit-length, and (4) the specification of S, 
consisting of such a weakly-skew straight-line program of every invariant in S, can be computed 
in poly{N) time. 

Assuming the parallel black-box derandomization hypothesis for PIT for diagonal depth three 
circuits ( cf. Section \2. the specification of S can be computed by a uniform AC^ circuit of 
poly{N) bit-size with oracle access to DET . 

(2) Suppose m is constant, or more generally, 0{polylog{n)) (as is the case if m = 0{^/d); cf. 
Lemma \8.17\ ). Then K[V]^ has a separating quasi-e.s.o.p (unconditionally). Furthermore, such 
a separating quasi-e.s.o.p. can be constructed by a uniform AC^ circuit of quasi-poly{n) hit-size 
with oracle access to DET. 

Theorem 11.31 follows from (2). 

To prove Theorem 18.51 we first recall some results from standard monomial theory [LR^ 
IDEPlt IDRSj and prove some lemmas. In what follows, we let N = n™ as above. 

8.3 The standard monomial basis of V 

We now define the standard monomial basis of V mentioned in Definition 18.41 following [DEPl^ 
ILRj and prove some lemmas concerning its complexity-theoretic properties. 

Let G = GLm{K). Let Z he an m x m variable matrix. Let K[Z] be the ring generated 
by the variable entries of Z. Let K[Z]ii denote the degree d part of K[Z]. It has commuting 
left and right actions of G, where (cr, a') £ G x G maps h{Z) £ K[Z]d to h{a^Za'). For each 
partition A : Ai > • • • Ag > 0, g < m, the Weyl module V\{G) labelled by A can be embedded in 
K[Z]d, d=\\\=Yji ^i, as follows. 

Let (A, B) be a bi-tableau of shape A. This means both A and B are Young tableau |Fuj 
of shape A such that (1) each box of A or B contains a number in [m] = {1, . . . ,m}, (2) all 
columns of A and B are strictly increasing, and (2) all rows are non-decreasing. Let Ai and Bi 
denote the i-th column of A and B respectively. With any pair {Ai, Bi) of columns, we associate 
the minor Z{Ai, Bi) of Z indexed by the row numbers occurring in Ai and the column numbers 
occurring in Bi. With each bi-tableau {A,B) we associate the monomial in the minors of Z 
defined by Z{A, B) := Z{Ai, Bi)Z{A2, B2)Z{A3, B^) ■ ■ ■ . We call such a monomial standard of 
shape A and degree d = |A|. It is known |DRS| that the standard monomials of degree d form a 
basis of K[Z](i. We call it the DRS-basis of K[Z]d and denote it by B{Z)d. 

A standard monomial Z(A,B) is called canonical if the column Bi, for each i, just consists 
of the entries 1, 2, 3, . . . in the increasing order. It is easy to see that, for each partition A, the 
subspace of K[Z] spanned by the canonical monomials of shape A is a representation of G under 
its left action on K[Z]. It is known [DEPH ILRj that this representation is isomorphic to the 
Weyl module Vx(G) of G, and that the set of canonical monomials of shape A form its basis. 
We refer to it as the standard monomial basis of V\{G), and denote it by Bx = B\[G). Each 
Weyl module V\{G) of G = SLmiK) is also a Weyl module of G in a natural way. Hence this 
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also specifies the standard monomial basis Bx of Vx{G). 

Fix the standard monomial basis Bx in each copy of V\(G) in the complete decomposition 
of V as in (j25p . This yields a basis B{V) of V, which we call its standard monomial basis. It 
depends on the choice of the decomposition of V (if the multiplicities are greater than one). But 
this choice does not matter in what follows. 

Lemma 8.6 (a) Given any nonstandard monomial n of degree d in the minors of Z, the rep- 
resentation of fi in the DRS basis B{Z)d can be computed in poly{d"^ ) time. More strongly, it 
can be computed by a uniform AC^ circuit of poly {d"^ ) bit-size with oracle access to DET. 

(b) Consider K[Z]d as a left G-module, where g € G maps h{Z) to (g • h){Z) = h{g^ Z). Then, 
given the specifying label (a bi-tableau) of any basis element b G BiZ)d and g G GLmiQ), the 
representation of g-b in the DRS-basis B{Z)d can be computed in poly{d^ , {g)) time, where {g) 
denotes the bit-length of the specification of g. More strongly, it can be computed by a uniform 
AG^ circuit of poly{d^ , (g)) size with oracle access to DET. 

(c) Let Vx{G) be a Weyl module of degree d, and Bx its standard monomial basis as described 
above. For any basis element b G Bx, and g G GLmiQ), the representation of g ■ b in the basis 
Bx can be computed in polyid^ , (g)) time. More strongly, it can be computed by a uniform AC^ 
circuit of poly{d^ , {g)) bit-size with oracle access to DET. 

When m is constant, the poly(d'" ) bound becomes poly(d) = poly(n). 
Proof: 

(a) Let B'{Z)d denote the usual monomial basis of K[Z]d consisting of the monomials in the 
entries Zij of Z of total degree d. The cardinality of B'[Z)(i is equal to the number of monomials 
of degree d in the m? variables Zij. This number is C^^T.^^) = poly(d™'^). The cardinality of 
B{Z)(i is the same. Let Ad be the matrix for the change of basis so that: 

B{Z)d = AdB\Z)d, and B' {Z)d = A-/B{Z)d. (29) 

2 

The matrix Ad can be computed in poly((i"^ ) time. For this, observe that each row of 
Ad corresponds to the expansion of a standard monomial b G B{Z)d in the usual monomial 
basis B'{Z)d. Since the number of monomials of degree < d in the m? variable entries of Z is 
poly(d™ ) and the degree of b is d, this expansion can be computed by a uniform weakly skew 
|MPj straight-line program ofpoly((i'" ) bit -size (constructed by induction on d). It follows |MPj 
that it can also be computed fast in parallel by a uniform AG^ circuit of po\y{d^ ) bit-size with 
oracle access to DET. This yields the representation of b in the basis B'[Z)d. Thus Ad can be 
computed by a uniform AG^ circuit of poly((i™ ) bit-size with oracle access to DET. 

Once Ad has been computed, A'^^ can also be computed fast in parallel by a uniform AG^ 

2 

circuit of poly((i™ ) bit-size with oracle access to DET. 

The standard representation in the basis B[Z)d of any nonstandard monomial ^ G K[Z]d 
in the minors of Z can now be computed fast in parallel as follows. Let 6(//) and 6'(/u) be 
the row vectors of the coefficients of the representations of /U in the bases B[Z)d and B'{Z)d, 
respectively. Clearly 6(/i) = b'{fi)A^^. Expand /i fast in parallel (as we expanded b above) to 
get its representation b'{fi). Multiply this on the right by A'^^ fast in parallel to get b{fi). 
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(b) First we expand g-b fast in parallel (as above) to get its representation in the usual monomial 
basis B'[Z)fi- The representation in B[Z)d can now be computed fast in parallel by multiplica- 
tion on the right by A'^^ . 

(c) This follows from (b) using the concrete realization of Vx{G) described before Lemma 18.61 as 
the G-submodule of K[Z]d, d = |A|, spanned by the canonical standard monomials of shape A. 
Q.E.D. 

We also note the following for future reference. 

Lemma 8.7 (a) Given any standard monomial b G B{Z)fi and a generic (variable) u ^ G, 
the representation of u ■ b in B{Z)d can be computed in poly{d^ ) time. More strongly, it can 
be computed by a uniform AG^ circuit of poly{d"^ ) bit-size with oracle access to DET. The 
coefficients of this representation are polynomials in the entries of u of degree d. 

(b) Given any basis element b G Bx and a generic u £ G, the representation of u ■ b in the basis 
B\ can be computed in poly{d"^ ) time. More strongly, it can be computed by a uniform AG^ 
circuit of poly{d^ ) bit-size with oracle access to DET. The coefficients of this representation 
are polynomials in the entries of u of degree d= |A|. 



The proof is similar to that of Lemma 18. 6[ 



8.4 Proof of Theorem 18.51 assuming explicitness of V/G 

Let V = (ui, . . . ,Vn) be the coordinates of V in the standard monomial basis B(y) of V . Let 
X = . . . , Xn) be new variables. Let 

X = Y,XiVieK[V-x], (30) 

i 

be a generic affine combination of ViS. Here i^[y;x] denotes the ring obtained by adjoining 
xi, . . . ,Xn to K[V] = K[vi, . . . ,Vn]. Then, for any < c < /, £ K[V; x] is a generic linear 
combination of all monomials in tij's of total degree c: 

Here ( ) denotes the multinomial coefficient, and the monomials (rf,-^i vf^) occurring in 
this expression form a basis of the subspace ^i^[l^]c ^ ^[^] of polynomials on V of degree c. 

Let R = Rg : K[V] K\V]'^ denote the Reynolds' operator for G (cf. Section 2.2.1 in 
[DKj ) . We denote the induced map from ^[y;;);] to by R as well. Here -fC[y]'^[x] 

denotes the ring obtained by adjoining xi, . . . ,x„ to Now consider a generic invariant 

R{X^){v,x)= (a \ )^(n<')(n<')e^[^]''N- (32) 

Since the monomials (HiM ^f') (|31l) form a basis of ^[yjc, it follows from the properties 
of the Reynold's operator that the elements ^(ni>i^r') ^ occurring in (j32p span the 
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subspace C of invariants of degree c. By Theorem 18.11 the invariants of degree 

< I generate -fC[y]'^. Hence the set 

F = {i?(n<0 I 5^ai = c,0 < c < /}. (33) 

i>l i 

generates 

Let A3[n,l,k] denote the class of diagonal depth three circuits (cf. Section [2.ip over K and 
the variables xi, . . . ,Xn with total degree < / and top fan- in < k. The size of any such circuit is 
0{knl). 

The following result is the key to the proof of Theorem 18.51 

Theorem 8.8 Let N = n"^^d"^* , and let I = nrrPd"^"^^ as in Theorem \ 8.1l Given n,m, < 
c <l, and the specification {V,G) of V and G as in eg. Ii26\) . one can compute in poly{N) time 
the specification of an arithmetic constant depth circuit C = m, c] over Q such that (1) C 
computes the polynomial R{X'^){v,x) in x = {xi, . . . ,Xn) and v = {vi^ . . . ,Vn), and (2) for any 
fixed h £ V , the circuit Ch obtained by specializing the variables Vi 's in C to the coordinates 
of h in the standard monomial basis B(y) of V is a diagonal depth three circuit in the class 
A3[n,c,k], with k = poly{N). 

More strongly, C can be computed by a uniform AC^ circuit of poly{N) bit-size with oracle 
access to DET . 

This result says that V/G for constant m is an explicit variety in the following sense. 

Definition 8.9 We say that V/G = spec{K\y]'-') is explicit if, given n,m, and the specification 
(y, G) of V and G as in eq. ^26\) . one can compute in poly{n, m) time a set of arithmetic circuits 
C = G\y,m,c\'s, 1 < c < g = poly{n,m), over Q of poly{n,m) bit size over the variables 
X = (xi, . . . , Xn) and v = (f i, . . . , Vn) such that the polynomials C[V, m, c](x, v) 's computed by 
C[V,m,c] 's are of poly(n,m) degree and can be expressed in the form 

G[V,m,c]{x,v) = ^/j-c(x)5j,c(w), 
j 

with homogeneous fj^c's and gj,c's, so that K[V]^ is generated by gj^c{v)'s and fj^c{x)'s are 
linearly independent. 

We say that V/G is explicit in a relaxed sense if the degree requirement on C\y,m,c\{x,v) 's 
is dropped. 

We say that V/G is strongly explicit if the circuits G[V, m, c] 's are weakly skew of poly (n, m) 
degree. 

This definition is a special case of a general definition of an explicit variety that we shall 
formulate later; cf. Definition 110.21 

By Theorem 18.81 V/G is strongly explicit if m is constant, and quasi-strongly-explicit if 
m = polylog(n). By Lemma 13.71 V/G in Theorem 13.61 is strongly explicit. More generally: 
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Conjecture 8.10 The categorical quotient V/G for V and G as in eq. i25\) is explicit in a 
relaxed sense for any m. 

Before proving Theorem 18.81 let us prove Theorem 18.51 using it. 

Proof of Theorem 18.51 (1) assuming Theorem 18.81 

Let N, k = poly(A''), and I be as in Theorem 18. 8[ Consider the class A^ln, I, 2k] of diagonal 
depth three circuits. 

By our black-box derandomization assumption for diagonal depth three circuits over K, 
there exists a hitting set T against A3[n, I, 2k] that can be computed in poly(n, k, I) = poly(A^) 
time. Assuming the parallel black-box derandomization hypothesis, T can be computed by a 
uniform AC^ circuit of poly(A^) bit-size with oracle access to DET. 

Fix such a T. By the definition of the hitting set, for any circuit D G A3[n, I, 2k] such that 
D{x) is not an identically zero polynomial, there exists b £ T such that D{b) ^ 0. 

For any b £ T and < c < I, define the invariant 

rb,e :=RiX^){v,b)eK[V]'^, 

and let 

5 = {rb,c \beT,0<c<l}C K[V]^. (34) 

The elements of 5 are homogeneous polynomials in v of degree < /, which is poly(n) if m is 
constant. 

Claim 8.11 The set S is separating. 

Proof: Let wi, . . . , Wn be auxiliary variables. For every c < I, define the symbolic difference 

R^{x, V, w) = R{X%v, x) - R{X^){w, x), 

where R{X'^){'w, x) is defined just like R{X'^){v, x) substituting w for v. Suppose e, f £ V are 
two points such that r(e) r(/) for some r E It follows that some generator in the set 

F in eq. (j33|) assumes different values at e and /. From eq. ([32]) . it follows that, for some c < /, 
R'^{x,e,f) is not an identically zero polynomial in x. By Theorem 18. 8|. R{X'^){e,x) is computed 
by a diagonal depth three circuit in the class A3[n,l,k]. Hence R^{x,e,f) is computed by a 
diagonal depth three circuit in the class A3[?i, /, 2k]. This means, for some b £ T, R^{b, e, /) ^ 0. 
That is, 

rbAe) = R{X'){e,b) / R{X^){f,b) = r.^)- 
This implies that S is separating. This proves the claim. 

It follows from the claim and Theorem 12.11 1 that -fCfy]'^ is integral over the subring generated 
by S. 

For any b £ T and < c < let Di,(. be the circuit obtained by specializing the circuit 
G[V, m, c] in Theorem 18.81 at x = b. Then Db^c computes r^^c = R{X^){v, b) as a polynomial in v. 
We specify S by giving, for every invariant r^^c £ the specification of D^^c- By Theorem 18. 8| 
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the circuit D^^c has constant depth and poly(A^) bit-size. Hence, it can also be specified by a 
weakly skew straight-line program of poly(A^) bit size. 

By our black-box derandomization hypothesis, the specification of T can be computed in 
poly(A^) time. Once T is computed, using Theorem 18.81 we can compute in poly(A^) time, for 
each b G T and c < I, the specification of the circuit Db c computing the invariant r^.c G S. Thus 
the specification of S in the form of a circuit Dh^c for each rb^c, or the corresponding weakly skew 
straight-line program, can be computed in poly(A^) time. Hence, S" is a separating e.s.o.p. 

Assuming the parallel black-box derandomization hypothesis, T, and hence S, can be com- 
puted by a uniform AC^ circuit of poly(A^) bit-size with oracle access to DET. 

This completes the proof of theorem 18.51 (1) assuming Theorem 18. 8[ 

Proof of Theorem 18.51 (2) assuming Theorem 18.81 

By Theorem 12.41 the black-box derandomization hypothesis holds for diagonal depth three 
circuits allowing a quasi-prefix. Hence, Theorem [83] (2) follows from (the proof of) Theorem 18.51 
(1) inserting quasi-prefixes in appropriate places. 

This completes the proof of Theorem 18.51 assuming Theorem 18.81 
8.5 Explicitness of V/G for constant m 

It remains to prove Theorem 18.81 Towards that end, we first prove some auxiliary lemmas. 

First we turn to the computation of the Reynolds operator R = Rq, G = SLm{K). Consider 
the representation morphism ip : V x G ^ V given by: {v,a) — )• a~^v. Let ip* : K[V] — t- 
K[V] ® K[G] denote the corresponding comorphism. Given / G -f^l^], let tp*{f) = ^iQi ® hi, 
where Qi G K\y] and hi G K[G]. 

Lemma 8.12 (cf. Proposition 4-5.9 in JHSi) 



This reduces the computation of Rg on K\V] to the computation of Rq on Since 
G, as an affine variety, has just one G-orbit, Rq maps K[G] to = K. Let Z be an 

m X m variable matrix. Computation of Rg on K[G] can be reduced to the computation of 
Rg on K[Z]. This is because K[G] = K[Z]/J, where J is the principal ideal generated by 
det(Z) — 1. li g G K[G] is represented by / G K[Z] then RG{g) = Rcif) + J- Furthermore, 
K[Zf = K[det{Z),det{Z)-^. 

The computation of Rg on K[Z] can be done using Cayley's O process as in |H12] . Here Q 
is a differential operator on K[Z] defined as follows. For any h{Z) G K[Z], 



where Sm is the symmetric group on m letters and vr ranges over all permutations of m letters. 



RG{f) = Y.9iRG{hi). 
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Lemma 8.13 (cf. Proposition 4-5.27 in JDK^ ) Suppose f G K[Z] is homogeneous. If the degree 
of f is mr then 

RG{f/det{Zr)=det{ZY-P^, 

where Cr,m = (det(Z)'^') G Z. // the degree of f is not divisible by m, then Rcif) = 0. 

If g ^ K[G] is represented by f ^ K[Z], then Rcid) = if the degree of f is divisible by 
m, and Rcig) = otherwise. 

Write det{Zy' = '^^aaa{zi^i, . . . , Zm,m), where a ranges over the monomials in -Zjj's of 
degree mr, and aa & Then 



' dzi 1 dz; 



The number of a's here is ["^^^^-^ ^) = poly(deg(/)™'^), when deg(/) = mr, and the bit- 
length of each Ua is poly(m,r) = poly(deg(/)). Hence € Q, for / G Q[Z] of degree mr, 

2 

can be computed in poly(deg(/)'" , (/)) time, where (/) denotes the total bit-length of the 
coefficients of /. This can also be done fast in parallel. Thus: 

Corollary 8.14 Given g G Q[G] C K[G] represented as a polynomial f G Q[Z, det(Z)~^], 
^cid) £ Q can be computed in |3o/y(deg(/)'" , (/)) time. More strongly, this can be done by a 
uniform AG^ circuit of poly{deg{f)"^ Af)) bit-size. 

Now let G = GLm{K). Then V as in (j25p is also a polynomial G-representation in a natural 
way so that, as a G-module: 

V = ®xmWVx{G). (35) 

— 2 2 4 

Let ti G G be a generic (variable) matrix. Let < c < / = poly(?i, ) and N = n™ be 
as in Theorem 18.81 For X is as in (130p . u ■ X can be expressed as: 

u ■ X = '^Xi{u ■ Vi) = '^ei{x,u)vi, (36) 

i i 

where Cj G Q[x, u] is a polynomial in Xj's and the entries of u that is linear in xj^s and has total 
degree < d in the entries of u, since y is a representation of G of degree < d. Hence, 

u-X' = {u-Xy = Y,l^M^,x), (37) 
where fi ranges over the monomials in the entries of u of total degree at most dc < dl = 

2 

poly(n,(i"* ) and j3^{v,x) is a polynomial of degree c m. v = {vi, . . . ,Vn) as well as x = 
(xi, . . . , Xn). The number of /i's here is < = poly(A^). 

2 4 

Lemma 8.15 Given n,m,d,c as above, N = d^ , and the specification {V,G) ofV and G 
as in /i26]). one can compute in poly{N) time, and more strongly, by a uniform AC^ circuit of 
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poly{N) bit-size with oracle access to DET, the specification of a circuit C' overQ of poly{N) hit- 
size on the input variables vi, . . . ,Vn and xi, . . . ,Xn and with multiple outputs that compute the 
polynomials (3^{v,x) 's in P^p . The top (output) gates of C are all addition gates. Furthermore, 
for any fixed h £ V , the circuit C'^ obtained from C by specializing the variables Vi 's to the 
coordinates of h (in the standard monomial basis of V ) is a diagonal depth three circuit with 
multiple outputs in the class A3[n,c, e], e = poly{N). By this, we mean that the sub-circuit of 
C below each output gate is in lS.^[n,c,e\. 

Proof: We cannot compute Pf^{v,x) in eq. (j37p by expanding (u • Xy as a polynomial in x, u, 
and V, since the number of terms in this expansion is exponential in n. But we can compute it 
by a constant depth circuit by evaluating (n • Xy at several values of u and then performing 
multivariate Van-der-Monde interpolation in the spirit of [Strlj as follows. 

First we show how to construct, for any fixed g G GLmiQ), a constant depth circuit Ag that 
computes the polynomial in v and x given by 

g-X' = {g.Xr = C£e,{x,g)v^r, (38) 

i 

where ei{x,g) is a linear form in x that is obtained by evaluating ei{x,u) in eq. p6p at u = g. 
Towards this end, we first construct a depth two circuit A'g with addition gate at the top that 
computes the quadratic polynomial in v and x 

g ■ X = ^ei{x,g)vi (39) 

i 

obtained by instantiating (]36p at u = g. Recall that vi, . . . ,Vn are the coordinates of V corre- 
sponding to the standard monomial basis B{V) of V compatible with the decomposition (|35p . 
Hence, using Lemma 18.61 (c), the coefficients of the linear form ei{x,g^, for given g G GL^(^^^ ^ 
can be computed in poly(n,(i"^ Ad)) ti™e, and more strongly, by a uniform AC^ circuit of 
poly(n,d"^ ,{g)) bit-size with oracle access to DET. After this, the specification of A'g can 
also be computed in poly(n, , ((7)) time, and more strongly, by a uniform AC^ circuit of 
poly(?i, d'" , (g)) bit-size with oracle access to DET. 

Next we construct Ag with a single multiplication gate of fan-in c at its top that computes 
the c-th power g ■ X computed by the output node of A'g. The polynomial Ag{v, x) computed 
by Ag is {g ■ X'^){v,x). Furthermore, for any fixed h £V, the circuit obtained by instantiating 
Ag at V = h is a depth two circuit with multiplication (powering) gate at the top. 

Next we show how to efficiently construct a circuit C for computing the polynomials /3^'s 
using Ag's for several g^s of poly(A^) bit-length. 

Let e be the number of monomials /i's in Uij with the degree in each Uij at most d' = 

2 2 

do. Then e = 0{{dc)"^ ) = poly(A^), since c < I = poly(n, ). Order these monomials 
lexicographically. For r < e, let fir denote the r-th monomial in this order. Choose m x m 
non-negative integer matrices gi,--- ,ge such that (1) the ex e matrix B = [fir{gs)], whose 
(s,r)-th entry, for s,r < e, is fir{gs), is non-singular, and (2) every entry of each g^ is < d' . We 
can choose such gg^s explicitly so that i? is a multivariate Van-der-Monde matrix as described 
in Section 3.9 in |MPj . Specifically, let E = [d'Y"" be the set of e integral points in , where 
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[d!] = {0, . . . Order E lexicographically. Let gs be the s-th point in E interpreted as an 

m X m matrix. Then i? is a non-singular multivariate Van-der-Monde matrix (cf. Sections 
3.9 and 3.11 in [MPj ) . It can be computed in poly(A^) time, and more strongly, by a uniform 
AC^ circuit of poly(A^) bit -size. Its inverse B ^ can be computed by a uniform AC^ circuit of 
poly(iV) bit-size with oracle access to DET. 

Let /3 denote the column- vector of length e whose r-th entry, for r < e, is fi^^{v,x). Let A 
denote the column vector of length e whose s-the entry, for s < e, is Ag^{v,x) = (qs ■ X^){y,x). 
Then, by dST]), 

A = BI3 and /3 = B-^A. 

Using the second equation, we can construct a constant depth circuit C (with multiple outputs) 
for computing the entries of /3 using the constant depth circuits Ag^ 's constructed above. Each 
output gate of C is an addition gate with fan-in e = poly(A^). Each gate at the second level 
from the top is the c-th powering gate, because the top gate of each Ag^ is the c-th powering 
gate. For a fixed h £ V, the circuit C'^ obtained by instantiating C at v = h is thus a diagonal 
depth three circuit with multiple outputs in the class A3 [n,c, e]. 

Since Ag^, for every Qs £ E, and B~^ can be constructed in poly(A^) time, the construction of 
C takes poly{N) time. More strongly, it can be computed by a uniform AC^ circuit of poly(A^) 
bit-size with oracle access to DET. Q.E.D. 

Now we turn to the construction of the circuit C = C[V,m,c] for computing R^X'^), as 
required in Theorem 18.81 given n, d, m, c and the specification {V, G) of V and G as in (|26|) . 

Let Uij denote the (i,j)-th entry of the generic u £ G, and u~j, the (i,j)-th entry of 
= Adj{u)/ det(ti). Substituting for u in eq. ljST]) . we get 

u-^ ■ X' = (n-i • Xy = Y,,Jp^{v,x), (40) 

where /i ranges as in eq. ()37p . and /u' G Q[n, det(n)~"^] is a polynomial in the entries of u and 
det(n)~^ obtained from by substituting u^^ for Ujj. The degree of the numerator of /x' is 

again poly(n, ). 

By Lemma [8712] and eq. ([40]l . 

R{X^){v,x) = Y,RG{^^')^^.{^,^)■ 

Here Rg{^J^') is a rational number that can be computed in poly(A^) time using Corollary 18. 14^ 
since the degree of /i' is poly(n, d^ ). Let C be the circuit for computing /3^'s as in Lemma[8T5l 
The circuit C is obtained by adding a single addition gate that performs linear combinations of 
the various output nodes of C computing /3^'s, the coefficients in the linear combination being 
the poly(A^)-time-computable rational numbers RcifJ-')- Since the top gates of C are addition 
gates with fan-in e, we can ensure, by merging the addition gates in the top two levels, that the 
depth of C is the same as that of C . The top gate of C after this merging is an addition gate 
with fan-in k = = poly(A^). 

Given n,d,m,c, and {V,G), the specification of C can be computed in poly(A^) time by 
Lemma 18.151 After this, the specification of the circuit C as above can also be computed in 
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poly(A^) time. More strongly, it can be computed by a uniform AC^ circuit of poly(A^) bit-size 
with oracle access to DET. 

For any fixed /i G y, the circuit Ch obtained by specializing the variables Wj's in C to the 
coordinates of /i is a diagonal depth three circuit in the class A3[n, c, /c], with k = = poly(A^). 
This is because, by Lemma 18.151 is a diagonal depth three circuit with multiple outputs in 
the class A3 [n,c,e], e = poly(A^). 

This completes the proof of Theorem 18.81 Q.E.D. 

With this, we have also completed the proof of Theorem 18.51 

We also note down the following consequence of the proof. 

Lemma 8.16 Let V and G be as in i25\) . A set of generators of K[V]'^ can be constructed by 
a uniform AC^ circuit of poly {d"^'^^^^"') bit-size with oracle access to DET. 

Proof: By Theorem 18. H i^r[y]*^ is generated by i?G(/i)'s where /i ranges over all monomials of 
degree < ci"*°'^' in the coordinates of the standard basis of V. The number of such monomials 
is 0(d'"°'^'"). Using Lemmas 18.71 18.121 and Corollary 18.141 all these Rci^J-Ys can be computed 
by a uniform AC^ circuit of poly((i"^°'^'") bit-size with oracle access to DET. Q.E.D. 

The following elementary fact was mentioned in Theorem 18. 5^ and (a) below has been used 
implicitly throughout this section. 

Lemma 8.17 Let V be as in Then (a) dim{V) = n > d, and (b) n = 0(2^(™)), if 

d = VL{m^). 

Hence I and N (as in Theorems \8. 1\ and \8. 5\) are 0{poly{n)), ifm is constant, andO{2PO^y^09in'>), 
ifm = 0{Vd). 

Proof: It suffices to prove this when V = Vx{G), A : Ai > • • • > 0, r < m. Note that Am = 0. 
If (i = |A| = rj(m^), Xi > m' = m/a, for some constant a. 

The dimension of Vx{G) is equal to the number of semi-standard tableau of shape A with 
entries in [m] = { 1 , . . . , m} |Fuj . By semi-standard we mean the columns are increasing and the 
rows are non-decreasing. Let A denote the Young diagram of shape A. 

(a) It suffices to construct d distinct semi-standard tableau of shape A with entries in [m]. 

For each location {i,j) in A, let Tjj be the semi-standard tableau whose k-th column, for 
k < j, consists of the entries 1,2,3,4,... in the increasing order, and for k > j, consists of 
the entries 1, 2, . . . , i — 1, i -|- 1, i -|- 2, . . . in the increasing order. Clearly Tj^'s are distinct and 
semi-standard, and there are d of them. 

(b) It suffices to construct ^(2™'/^) distinct semi-standard tableau of shape A with entries in 
H = {1, • • • ,m}. 

First construct a basic semi-standard tableau T of shape A by filling the boxes of A as 
follows. The first column of T contains the entries 1,2,3, .. . in the increasing order. Suppose 
1 < j < m' < Ai and the (j — l)-st column of T has been constructed. Suppose the length of 
the j-th column of A is q. We construct the j-th column of T as follows. We copy the first q 
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entries of the constructed {j — l)-st column in the j'-th column of A. Then we read this j'-th 
column from bottom to top and increase the first encountered entry less than m that can be 
increased without violating semi-standardness. This is always possible if j < m' < Xi. The 
entries in the columns of A numbered m' + 1 to Ai are filled in any way so as to ensure that T 
is semi-standard. 

Now suppose m' is even, the odd case being similar. Given any a : [l,m'/2] — )■ {0, 1}, we 
construct a semi-standard tableau T„ using T as follows. The columns of T„ with numbers m' + l 
onwards are the same as those of T. The odd numbered columns of Tq- with numbers less than 
m' are also the same as those of T. The even numbered columns of T„ with numbers at most 
m' are constructed as follows. For any j G [l,m'/2], if a{j) = 0, the 2j-th column of T„ is the 
same as the 2j-th column of T. If a{j) = 1, the 2j-th column of To- is obtained by just copying 
the first q entries of the (2j — l)-st column of T, where q is the length of the 2j-th column of 
T. It is easy to see that To-'s thus constructed are all semi-standard and distinct. Furthermore, 
there are 2"^'/2 of them. Q.E.D. 

8.6 General m 

The following is the full statement of Theorem II. 4i 

Theorem 8.18 Suppose K is an algebraically closed field of characteristic zero. Let V as in 
^ he a rational representation of G = SLm{K) of dimension n. Suppose V/G is explicit in a 
relaxed sense (cf. Conjecture \8.10\l . 

(a) Suppose the black-box derandomization hypothesis for PIT over K holds. Then K[V]'^ has 
a separating e.s.o.p. in a relaxed sense (cf. Definition \8.4\ )- 

(b) Suppose PIT for circuits over K of size < s has O {2^^) -time- computable hitting set for any 
small constant e > 0. Then K\y]'^ has a separating subexponential- e.s.o.p. in a relaxed sense 
for any exponent 6 > 0. 

(c) A separating s.s.o.p. in a relaxed sense exists for K\y]^ and can be constructed in work- 
space that is polynomial in n and m (unconditionally). Assuming GRH, it can be constructed 
by a Tj^- algorithm. 

(d) Analogues of Theorem and Theorem \l.l\ (d) (2) and (3) also hold in this setting if V/G 
is explicit (with degree restriction as in Definition \8.9\l . 

Proof: 

(a) : The proof is similar to that of Theorem 18.51 (1), using the assumed explicitness of V/G 
in place of Theorem 18.81 and the black-box derandomization hypothesis for PIT in place of the 
black-box derandomization hypothesis for diagonal depth three circuits. 

(b) : The proof of (a) can be modified in a straightforward manner. 

(c) : Suppose y/G is explicit in a relaxed sense (cf. Definition [82]). Given any circuit C[y, m, c](x, 
as in Definition 18.91 Ist C[y, m, c](x, f') denote the circuit obtained from C[y, m,c] by replac- 
ing V = {vi, . . . ,Vn) by another tuple v' = {v[, . . . ,v'^) of variables. Let C[V,m,c\{x,v,v') = 
C[V,m,c]{x,v) — G[V,m,c\{x,v'). Let T be the hitting set provided by Theorem 12.11 against 
the class of circuits obtained by letting the v and v' variables of the circuits C[V,m,cys range 
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over the coordinates of the points in V'^ = V x V (with v being of the coordinates of the first 
copy of V, and v' of the second). We use this new T in place of the old T in the proof of (a). 
Let 5 C Er[y]'^ be the set of invariants as in eq. ([M|) using this new T instead. Since we do not 
have any efficient algorithm to construct the new T, we do not have any efficient algorithm to 
construct the new S either. Hence the new S will only be a separating s.s.o.p. (in a relaxed 
sense) instead of a separating e.s.o.p. (in a relaxed sense). It can be constructed in polynomial 
work space (or by a Ss-algorithm assuming GRH) by the proof technique of Theorem 13.111 using 
the assumed explicitness of V/ G in place of Lemma 13. 7|, the key to the proof of Theorem 13.111 

(d) If V/G is explicit (with degree restriction), one only needs the black-box derandomization 
hypothesis for PIT with small degree in (a). The proof of (a) in this case can be extended just as 
the proof of Theorem 13.61 was extended to prove Theorem 11.21 and Theorem ll.il (d) (2) and (3). 
Degree restriction is needed since Theorem 12.21 (a) only holds for PIT of small degree. Q.E.D. 

9 Generalizations 

In this section we briefly state generalizations of the preceding results to any classical simple 
algebraic group instead of the special linear group. 

Let G C SLm (with the standard embedding) be a classical simple algebraic group over an 
algebraically closed field K of characteristic zero and V its rational representation of dimension 
n. Let denote the ring of invariants. Since G is reductive, V decomposes as 

V = (Bxm{\)Vx{G), (41) 

where A ranges over the highest weights of G, Vx{G) denotes the irreducible Weyl module |Fuj 
of G labelled by A, and m(A) its multiplicity. We assume that V and G are specified succinctly 
by the tuple 

{V,G) := (n,m; {X\m{X')y, . . . ; (A^m(A''))) (42) 

that specifies n and m (in unary), and the multiplicity m{\^) (in unary) of each Weyl module 
Vxi (G) that occurs in the decomposition (j^Tj) with nonzero multiplicity. For each copy of Vx{G) 
that occurs in this decomposition, fix the monomial basis Bx of Vx{G) as defined in |RSj . We 
refer to it as the RS-basis of Vx (G) . (We could also have used the standard monomial basis |LRj 
here. But this would make the calculations below a bit more involved.) This yields a basis B{V) 
of V, which we call its RS-basis. The elements of Bx are indexed by LS (Lakshmibai-Seshadri)- 
paths |LRt IRSj instead of tableau now. Let e^, fi and /cj's denote the standard generators of the 
Lie algebra Q of G. Let vx denote the highest weight vector of Vx{G). With every LS-path r] 
(dominated by the highest weight A) , the article |RS] associates a monomial fj,r^ in the generators 
/i's such that Hr]{vx) is the element of Bx indexed by r]. 

We can define an e.s.o.p. and the related notions in this general setting very much as in 
Definition 18.41 using the basis B{V). 

Theorem 9.1 Analogues of Theorems \8.5\ and \8.lS\ also hold for V and G as above, except for 
the statements therein concerning parallelization of the construction of a separating e.s.o.p. or 
quasi-e.s.o.p. 
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This result also generalizes to direct products of tori and classical algebraic groups. 

Proof: (Sketch): Since the proof is very similar to that for the special linear group, we only 
sketch how to handle the differences. We only sketch how to extend the proof of Theorem 18.51 
in this setting, the other case being similar. 

Let / C K[SLm] denote the ideal so that K[G] = K[SL„i\/ 1 . Let [/ be a variable m x m 
matrix. Identify K[SLm] = K[U]/{det{U) — 1). Order the entries of U row-wise. Fix a Grober 
basis for / with respect to the reverse lexicographic degree ordering on the monomials in the 
entries of U. Since m is constant such a Grobner basis can be computed in constant time. Let 
B' be the resulting basis of K[G] consisting of the standard monomials in the variable entries 
of U. Let K[G]<d C K[G] be the subspace spanned by standard monomials of total degree < d 
in the entries of U. Both K[G] and if[G]<rf are G x G-modules, with the first copy of G acting 
on the left and the second copy on the right. Let B'^^ = B' K[G]<:d be the restricted basis of 
K[G]<(i- We have the Peter-Weyl decomposition |Fuj 

K[G]=(BxVx{Gr 0Vx{G), (43) 

where A ranges over the highest weights of G. Let 

B = exBl ® Bx 

be the RS-basis of K[G], and let B^d = B D K[G]<:d be the restricted basis. The elements of B 
are indexed by the pairs (Ci??) of LS-paths dominated by (A*, A) for some A. Let vx*,x = v\0vx 
denote the highest weight vector of the G x G irreducible module Vx{G)* (g) Vx{G) C K[G]. 
With any pair (C*,??) of LS-paths dominated by (A*, A), |RSj associates a unique pair {fJ-^* , fJ-rj) 
of monomials in /j's such that the element b^*'^ of B^ (g) Bx indexed by (Cv) is (/^c* ' 
with ij,^* acting on the left and //^ on the right. 

The highest weight vectors vx*,x G B<^d satisfy the equations 

{ei,ej)vx*,x = 0, foralH,j, 

where e^'s acts on the left and e^'s on the right. Solving this system of equations, and classifying 
the solutions by weights, we get the specifications of all highest weight vectors vx*,x £ B<d 
represented in the basis B'^^. (Here we are using the fact that, for each (A*, A), the highest 
weight vector 'L'a*,a & B is unique, since K[G] contains only one copy of Vx{G)* ® Vx{G).) Since 
the action of (/j, /j)'s on B'^^ is easy to compute, the action of the monomial operators (/i^*, /i^)'s 
on the basis B'^^ is also easy to compute by induction on length. Thus we can compute the 

specifications of ^^«'^ = (/^c*' /^^)^-^*.fA's B<d efficiently. It can be shown that this can be 
done in poly((i™^) time. This yields the transition matrix T<d from B<d to B'^^. In additional 
poly((i™' ) time we can also compute its inverse T^^ . 

Since Vx{G) can be embedded in an appropriate K[G]<d via the Peter-Weyl decomposition 
([I3l) . we can carry out all calculations on Vx{G) within K[G]<:d using the bases B'^^,B<d, and 
the transition matrices T<:d and T^^. After this the analogues of Lemmas 18. 6118.71 in this general 
setting are easy to prove (except for the statements concerning parallelization) . The analogue 
of Corollary 18.141 is also easy to prove. We can not use Cayley's process now as it is specific 
to the special linear group. But the Reynold's operator on K[G] corresponds to the projection 



55 



onto the trivial G x G- module in the Peter- Weyl decomposition ([I3]) of So Rg{w), for 

any w S 5<^, can be calculated by first expressing w in the basis B<d using the matrix T^J, 
and then projecting it onto the trivial G x G-module in the Peter- Weyl decomposition (j43|) of 
K[G]<d- The rest of the proof is similar to the proof in the case of the special linear group. We 
omit the details. Q.E.D. 

The following is the conjectural generalization of Theorem 13. 131 (a) in this setting. 

Conjecture 9.2 Suppose V and G are as in Theorem \9.1[ Then, given any rational v £ V , 
the instability flag of a one-parameter subgroup driving v to the unique closed G-orbit in the 
G-orbit closure of v can be computed in time that is polynomial in n, m, and the bit-length of 
the specification of v. 

10 Equivalence 

In this section we describe the relation (Theorems 11.51 and I1.6P between a stronger form of 
black-box derandomization and derandomization of Noether's normalization lemma for explicit 
varieties. 

10.1 Stronger forms of black box derandomization of PIT 

First, we define this stronger form of black-box derandomization of PIT for small degree circuits. 

Let K be an algebraically closed field of characteristic zero and set of r 

variables. The stronger black-box derandomization problem in this context is to construct in 
poly(s) time a hitting set against all nonzero polynomials /(x) G K[x] of degree < d = 0{s°'), 
a > a constant, that can be approximated infinitesimally closely by arithmetic circuits over K 
and X of size < s. 

By infinitesimally close approximation, we mean that, given any e > 0, there exists such 
a circuit C = of size < s such that the distance ||C(a;) — /(x)||2 between the coefficient 
vectors of C{x) and f{x) in the L2-norm is less than e. The circuit C can depend on e. By 
a hitting set, we mean a set Sr,s of test inputs such that (1) the total bit-length of the 

specification of each test input is poly(s), and (2) for every nonzero f{x) of degree < d that can 
be approximated infinitesimally closely by circuits over K of size < s, Sr,s contains a test input 
b such that / 0. 

The strong black-box- derandomization hypothesis for PIT for small degree circuits is that 
there exists a poly(,s)-time-computable hitting set Sr^s- The strong black-box derandomization 
hypothesis for general PIT without any degree restriction is defined similarly. Parallel versions 
of these hypotheses are defined as in Section 12.11 A similar strong black-box derandomization 
hypothesis for SDIT (symbolic determinant identity testing), cf. Section [2.H is that, given m, 
one can construct in poly(m) time a hitting set against all nonzero homogeneous polynomials 
over K of degree m that can be approximated infinitesimally closely by symbolic determinants of 
size m. The strong black-box derandomization hypothesis for the symbolic permanent identity 
testing (SPIT) is similar, using the permanent in place of the determinant. 
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The following result is the analogue of Theorem 12.21 for strong black-box derandomization. 
This is why PIT is expected to have efficient strong black-box derandomization. 

Theorem 10.1 Suppose K is a field of characteristic zero. 

(a) Suppose there exists an exponential-time- computable multilinear polynomial p in m variables 
with integral coefficients of poly{m) bit-length such that p can not be approximated infinitesimally 
closely by arithmetic circuits over K 0/0(2"^") size for some constant a > 0. Then PIT for 
small degree circuits has 0{2P^^y^^9i^)y time- computable strong black-box derandomization. 

(b) If p cannot be approximated infinitesimally closely by arithmetic circuits over K of 0{m°') 
size for any constant a > 0, then PIT for small degree circuits has 0{2^^)-time computable strong 
black-box derandomization, for any e > 0. 

(c) Suppose there exists an exponential-time- computable multilinear polynomial p in m variables 
with integral coefficients of poly{m) bit-length such that p can not be approximately infinites- 
imally closely by arithmetic circuits over K, with oracle gates for the permanent, of 0(2™") 
size for some constant a > 0. Then SPIT has O(2^'o¥o5(«)) 

-time- computable strong black-box 

derandomization. 

(d) Suppose there is an exponential-time computable multilinear polynomial p in m variables 
with integral coefficients of poly{m) bit-length such that p can not be approximated infinitesimally 
closely by arithmetic circuits over K , with oracle gates for the permanent, ofO{m°') size for any 
constant a > 0. Then SPIT has 0{2^^)-time computable strong black-box derandomization, for 
any e > 0. 

Proof: (Sketch) The proof is similar to that of Theorem 7.7 in |KIj . Hence we only sketch how 
to modify the proof in jKIj . We only consider (a), the cases (b), (c), and (d) being similar. 

(a) We want to construct a hitting set against every nonzero polynomial f[x) of degree < d = 
0{s°'), a > a constant, that can be approximated infinitesimally closely by arithmetic circuits 
over K of size < s. Choose m = (log sY, for a large enough constant e to be fixed later. With this 
choice of m and using the hard function p{yi, . . . , ym), we get an arithmetic NW-generator NW^ 
as in the proof of Theorem 7.7 in jKIj . Let H denote the set of test inputs constructed using this 
NW-generator as in jKIj . We claim that H is a hitting set as desired. Suppose to the contrary 
that f{b) = 0, for every b € H, for some nonzero f{x) of degree < d that can be approximated 
infinitesimally closely by arithmetic circuits over K of size < s. It then follows as in the proof of 
Lemma 7.6 in |KIj . but using Theorem 12.31 instead of Lemma 7.2 in jKIj . that p{yi, ■ ■ ■ ,ym) can 
be computed by an arithmetic circuit C over K of size 0{s'^) using oracle gates for / for some 
constant c > independent of e. Given any circuit Ds of size < s for approximating / within 
precision 5 > 0, let C5 denote the circuit obtained from C by substituting for /. Since / 
can be approximated infinitesimally closely by circuits of size < s, by choosing 5 small enough, 
Cs{y) can approximate p{yi, . . . ,ym) to any precision. The size of Cs is 0(s^+^). For any 5, 
choosing e large enough, the size of Cs can be made < 2"^^ for any e > 0. This contradicts 
hardness of p. Q.E.D. 

10.2 Explicit algebraic varieties 

Next we define an explicit algebraic variety. 
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Definition 10.2 Let K he an algebraically closed field of characteristic zero. 

(a) A family {Wn}, n — oo, of affine varieties is called explicit if there exists a map tpn '■ K'^ — t- 

V= {vi,...,Vr) {fl{v),...Jm{v)), (44) 

r = poly{n), m = n^^^\ logm = O{poly{n)), each fj a homogeneous polynomial of poly(n) 
degree, and homogeneous polynomials gj{x), x = (xi, . . . ,Xn), ^ < j < m, of poly{n) degree such 
that: 

1. Wn is the closure of the image Im{Tpn) of ipn- 

2. The polynomial Fn{v,x) = fj{v)gj{x) is uniformly p- computable \Vlf - By uniform, we 
mean that one can compute in poly{n) time a poly{n)-size circuit Cn over Q that computes 
Fn{v,x). Its degree deg{Fn) is poly{n). 

3. The polynomials gj{x) 's are linearly independent. 

We call ijjn the map defining Wn, and Fn the polynomial defining Wn- We specify Wn 
succinctly by the circuit Cn- 

We say that {Wn} is strongly explicit if the circuit Cn is weakly skew. 

(b) We say that {Wn} is weakly explicit, if F{v,x) above is equal to the permanent of a matrix 
Dn of poly{n) dimension such that (1) each entry of Dn is a homogeneous bilinear form in v 
and X, and (2) the description of Dn is poly (n) -time computable. 

We say that {Wn} is positive if the coefficients of all bilinear functions that occur as the 
entries of Dn are all non-negative rational numbers. 

(c) A family of projective varieties is called explicit (strongly explicit, weakly explicit, or positive) 
if the family of the affine cones of these varieties is explicit (respectively, strongly explicit, weakly 
explicit, or positive). 

(d) A family explicit affine or projective varieties in a relaxed sense (without any degree restric- 
tion ) is defined just as in (a) and (c) hut without putting any restriction on the degrees of fj , gj 
and Fn. 

The definition of an explicit or weakly explicit variety over an algebraically closed field K of 
arbitrary characteristic is similar. 

10.2.1 Examples of explicit varieties 

(1) Explicit categorical quotients and related varieties: The variety V/G, with V and G as 
in Theorem 13.61 is a strongly explicit variety with the defining map ip = ttv/g ^ ™ (|12p : cf. 
Lemma 13.71 Explicitness of V/G is a, key ingredient in the proof of Theorem 13.61 The variety 
V/G for V and G as in Theorem 14.11 is also strongly explicit. 

By Theorem 18.81 the variety V/G, with V and G as in Theorem 18.51 is strongly explicit 
for constant m, and quasi-strongly-explicit for m = polylog(n). Explicitness of V/G is a key 
ingredient in the proof of Theorem 18.51 The variety V/ G is explicit in a relaxed sense without 
any degree requirement for any m as per Conjecture [87101 
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Let U = {Ui, . . . , Ur) and U' = {U[, . . . ,U^) be two tuples of mxm variable matrices. Think 
of them as the coordinates of V = Mm{KY . Expand Ti{X, U, U') in eq. (jl6p as 

fi{X,U,U') = Y,TaiX)iTa{U) - T^{U')) = Y,\MT[a]{X)m^^{U) - r[,](C/')), (45) 

a [a] 

where [a] = [ai ■ ■ ■ ai] ranges over the equivalence classes of all words of length / with each 
aj G [r], and |[a]| denotes the cardinality of [a]. Let y[F, G, Z], I < ni^, be the strongly explicit 
variety defined by Ti{X, U,U'), with T^^] {U) — T[a] iU')^s playing the role of /j's in Definition ll0.2l 
and |[a]|r[Q,](X)'s playing the role of gj^s. The construction of a separating e.s.o.p. for ^[1/]"^ in 
the proof of Theorem 13.61 is based on explicitness of the varieties G,/]'s. The variety V/G 
in Theorem 13.61 has explicit defining equations given by the second fundamental theorem for 
matrix invariants [Pr^ IRzj . Finding similar explicit defining (or close to defining) equations for 
Y[V, G, /]'s seems extremely hard. This is why the problem of constructing a separating e.s.o.p. 
(unconditionally) turns out to be so wild. 

(2) The Grassmanian (and G/P in general) is an explicit variety with the defining map ip being 
the well-known Pliicker map (cf. ()47p below). Explicitness of the Grassmanian is a key ingredient 
in the black-box derandomization of the Grassmanian SDIT below (Proposition 1 1 1 . 5|) . 

(3) The variety A[det,m] associated with the determinant in |MSlj (in the context of the per- 
manent vs. determinant problem) is explicit. The variety A[perm, n, m] associated with the 
permanent in [MSlj (in the same context) is weakly explicit and positive. These varieties are 
defined as follows. 

Let X be an m X m variable matrix. Let y be an n x n submatrix of X, say its lower-right 
n X n subminor. Let z be any entry of X outside Y. Let X be the vector space over C of 
homogeneous polynomials of degree m in the variable entries of X. Thus g = det(X) is an 
element of X. Then X is a representation of GL^2(C), where a G GL^2(C) maps h{X) € X to 
h(a~^X), thinking of X as an m^-vector. Let P{X) be the projective space associated with X. 
Then A[det,m] C P{X) is the closure of the orbit Gg C P{X) in the usual complex topology 
of P{X). The variety A[perm, n, m] C P{X) is constructed similarly using the homogeneous 
polynomial z™~"perm(y) £ X in place of the determinant. 

The affine cone of A[det, m] is explicit with the defining map ip : M^2{K) — )• X that maps 
V G M^2{K) to det{vX). The polynomial defining A[det,m] is (ie,t{vX). 

The affine cone of A[perm, n,m] is weakly explicit and positive. 

(4) Explicit variety associated with a p-computable polynomial: 

Let {pn{v,x)}, V = (vi, . . . , Uj.), X = (xi, . . . , Xn), be a uniform p-computable family of 
polynomials over K homogeneous in v. Let pn{v,x) = ^ ^ f ^i{v) lJi{x) , where ^ ranges over all 
monomials in x of total degree < deg(p„) = poly(n). Let m be the number of such monomials. 
Let ip = ipn he the map 

Let gfj,{x) = fJ.{x). Then Wn = Im{'ipn) is an explicit variety with the defining map ■0n and the 
defining polynomial Pn- 

(5) Explicit toric variety associated with a p-computable polynomial: 
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Let {pnix)}, X = (xi, . . . ,Xn), be a uniform p-computable homogeneous polynomial over x 
and K. Let Pn{x) = X^^a^A^l^;), where £ K and /i ranges over all monomials in x of total 
degree = deg(p„) = poly(n). Let m be the number of such monomials. Consider the monomial 
map Vn: 

ll^n-V= {vi,...,Vn) G ^ (...,a^/i(w),...) G K"^. 

Let Wn = Im{ijjri), and P(VFn) its projectivization. Then P{Wn) is an explicit toric variety, 
with the defining polynomial 

Fn{v,x) = '^a^n{v)fi{x), 
which is p-computable and uniform. 

(6) The toric variety in characteristic zero associated with the Birkoff polytope (cf. Section 6.2 
in |DSj ) is weakly explicit and positive. 

(7) We call an explicit W with dim(VF) = 1 an explicit curve. We define explicit surfaces, 
explicit three-folds, and so on, similarly. 

10.3 Implication of strong black-box derandomization for explicit varieties 

We now describe an implication of strong black-box derandomization for explicit varieties. 

Definition 10.3 Let W = Wn be an explicit variety as in Definition \10.2l zi,...,Zm the 

coordinates of , and ip* the comorphism of ip in \44^ - Note that K\W\ is graded, with 
deg(zj) = deg(/j). 

(a) We say that s G has a short specification if 'ip*{s) has a straight-line program over Q 

and vi, . . . ,Vr of 0{poly{n)) hit-length that computes the polynomial function on K^' correspond- 
ing to ip*{s). 

(h) We say that a set S C is an explicit system of parameters (e.s.o.p.) for K\W] if (1) 

each element s £ S has a short specification as in (a) and is homogeneous of poly{n) degree, 
(2) K[W] is integral over its subring generated by S, (3) the size of S is poly{n), and (4) the 
specification of S, consisting of a straight-line program for i/j*{s) for each s £ S as in (a), can 
be computed in poly[n) time. 

An s.s.o.p. is defined by dropping the condition (4). 

(c) We say that Noether's normalization lemma for the coordinate ring K[W] of W is deran- 
domized if K[W] has an e.s.o.p. 

(d) We say that s G ^^[M^] is strict if, for some 6 G of poly {n) bit-length and < c < deg(F„), 
s = Ylj ^j9ji^)' where j ranges over all indices such that deg(/j) = c. Such a strict s can be 
specified by the pair (b,c). We call an e.s.o.p. strict, if its each element is strict. A strict 
e.s.o.p. is specified by the pairs for all its elements. We say that Noether's normalization lemma 
for K[W] has strict derandomization if K[W] has a strict e.s.o.p. A strict s.s.o.p. is defined 
similarly. 

(e) Strict derandomization of a weakly explicit but positive variety ( Definition 1 1 0. 2\) is defined 
similarly. 
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(f) An s.s.o.p. or e.s.o.p. (strict or otherwise) and derandomization in a relaxed sense (without 
degree restriction) are defined similarly by dropping the degree requirement in (b) (1). 

For each s in a strict, weakly explicit, positive system S of parameters for /('[W^s] as in 
(e), is the permanent of a matrix of polynomial size each whose entries is a linear 

combination of ViS with non-negative rational coefficients. Given any rational non-negative v, 
whether ip~^{s){v) is nonzero can thus be decided in polynomial time and approximate value of 
il)~^{s){v) can also be computed efficiently by an FPRAS jJSVj . 

The following is a full statement of Theorem 11.51 (3). 

Theorem 10.4 Let K be an algebraically closed field of characteristic zero. 

(a) Noether's normalization lemma for the coordinate ring K[W] of an explicit variety W has 
strict derandomization if PIT for small degree circuits over K has strong black box derandom- 
ization. (Analogous statement also holds in a relaxed sense without any degree restriction on the 
explicit variety and PIT.) 

(b) Noether's normalization lemma for the coordinate ring of the explicit variety A[det,m] (cf. 
Example 3 in Section \10.2.1]) has strict derandomization if SDIT has strong black box deran- 
domization. The straight-line programs of an e.s.o.p. here can also be assumed to be weakly 
skew. 

(c) Noether's normalization lemma for the coordinate ring K\W] of a weakly explicit but positive 
variety W has strict derandomization if SPIT has strong black box derandomization. 

(d) The e.s.o.p. S constructed in (a), (b), or (c) can also be assumed to be separating. 

Here we say that S is separating if for any two distinct points u,v G W there exists an 
s £ S such that s{u) ^ s{v). We say that Noether's Normalization Lemma for can be 

derandomized in a strong form if has a separating e.s.o.p. 

Analogous result holds over an algebraically closed field of any characteristic. 

Proof: We will only prove (a) and (b), (c) being similar, and (d) being a simple extension of (a), 
(b), and (c). 

(a) Let W = Wn be an explicit variety as in Definition 110.21 and C„ the circuit computing Fn 
as there. Let s = poly(n) be its size, and d = poly(n) its degree. 

By the strong black-box derandomization hypothesis, there exists a poly(s)-time computable 
hitting set T against all nonzero polynomials h{x) = h{xi, . . . ,Xn) of degree < d that can be 
approximated infinitesimally closely by arithmetic circuits over K of size < s. 

For each b £ T and < c < deg(F„), define h^^ciz) := Ylj ^j9ji^) ^ -^[^]; where j ranges 
over all indices such that deg(/j) = c, and z = {zi, . . . , Zm) denote the coordinates of K^. Then 
deg{hb^c) = c. Let 

S = {h,c{z) I 6 G T, < c < deg(F„)} C K[W]. (46) 

Since each element in S is clearly strict, it suffices to show that S is an e.s.o.p.; cf. Defini- 
tion [1031 This follows from Lemma 1 1 . 5 1 b elow . 

(b) Use Lemma 110.61 instead of Lemma llO.51 Q.E.D. 
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Lemma 10.5 Suppose W is an explicit variety and PIT for small degree circuits has strong 
black box derandomization. Let S and T he as in Then: 

(a) W D Z(S) = {0}, where Z{S) C is the zero set of S, and denotes the origin in . 
(h) The coordinate ring K\W] is integral over the subring generated by S. 
(c) The set S is an e.s.o.p. 

Proof: Let tp = tp^^ fj, Oj, and F = Fn{v,x) be as in Definition 110.21 

(a) Consider any nonzero point w = {wi, . . . , Wm) E C K™. We have to show that hh^dw) ^ 
for some b £ T and < c < deg(F„). Let Fw{x) = Wjgj{x). Recall that is graded, with 
deg(2;j) = deg(/j). Let F^(x)c denote the degree c component of Fw{x). Then F^(6)c = hb,c{w). 
So we have to show that F^{b)c 7^ for some b £T and < c < deg(i^„). 

Since W = Im(v^), there exists, for any S > 0, an hs £ such that ||V'(/i<5) — u^lb < 5/2""" ; 
for some large enough positive constant c to be chosen later. Taking 5 to be small enough, we 
can assume that '4'{hs) 7^ 0. Since ipihs) = {fi{hs) fm{h&)) and W is explicit, it follows 
from Definition 110.21 (a) that Fn{hg,x) = fj{hs)gj{x) is not an identically zero polynomial 
in X. Let Cn be the circuit computing Fn{v,x) as in Definition 110.21 Let Cn,6 be the circuit 
obtained from Cn by specializing v to hg. Then the size of Cn,s is s = size(Cri) = poly(n) and 
the degree is d = deg(C„) = poly(n). Furthermore, 

||Cn,5(x)-F^(x)||2 = \\J2ifjihs)-Wj)g,{x)\\2<2^^mhs)-w\\2<S, 

j 

for some large enough positive constant c. Since, 6 can be made arbitrarily small, it follows that 
Fyj[x) can be approximated by circuits of degree < d and size < s. Since T is a hitting set, there 
exists b £ T such that Fw{b) 7^ 0. Hence Fw{b)c 7^ for some c < deg(F„). This proves (a). 

(b) By (a) and Hilbert's Nullstellansatz [E], it follows that, given any t £ K[W],t'- belongs to 
the ideal (S) in generated by S for some large enough positive integer I. Since is 
graded, it now follows from the graded Noether's normalization lemma (Lemma 12.101 (a)) that 

is integral over its subring generated by S. This proves (b). 

(c) We have to verify the properties (l)-(4) in Definition 110.31 (b). 

(1) We have to show that each hh^dz) £ S has a short specification. We have ip* (hb^div) = 
Fn{v,b)c- Since W is explicit, cf. Definition 110. 2( we can compute the description of the circuit 
Cn over Q computing Fn in poly(n) time. Hence the total size of Cn, including the bit-lengths of 
the constants in it, is poly(n). The circuit Cn;b for computing Fn{v, b) is obtained by instantiating 
the circuit C„ at x = 6. Hence its total size (including the bit-lengths of the constants) is poly(n). 
Using Van-der-Monde interpolation as in |Strlj (cf. also the proof of Lemma 18.151 where this 
technique was used), we can construct using Cn-t a circuit Cn;b;c of poly(n) size for computing 
ip*{hb^c){v) = Fn{v, b)c- Thus Cn;b;c Can be constructed in poly(n) time. It has poly(n) bit-length 
and poly(n) degree. This specification of Cn-b;c can be converted into a straight-line program of 
poly(n) bit-length. This shows that each hb^c{z) has a short specification. 

(2) It follows from (b) that is integral over the subring generated by S. 

(3) Since the size of T is poly(s) = poly(n), and deg(F„) is poly(n), the size of S is clearly 
poly(n). 
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(4) We saw above that the specification of each circuit Cn-b;c computing iIj* {hf,^c) can be computed 
in poly(n) time. Hence it follows that the specification of 5, consisting of a circuit Cn-b;c 
computing ^*(/ife,c) for each /ib,c S S, can be computed in poly(n) time. 

This shows that S is an e.s.o.p. Q.E.D. 

Lemma 10.6 Suppose A[det,m] is the explicit variety as in Example 3 in Section \lU.2.1[ As- 
sume that SDIT has strong black box derandomization. 

Then the set S in o-iT' ^-S-o.p. for the coordinate ring o/ A[det, m]. The straight-line 

programs of the elements in S can also be assumed to he weakly skew. 

Proof: The proof is just like that of Lemma llU. 51 We only observe that in the case of A[det, m] 
the defining polynomial Fn{v, x) in Definition 110.21 is the determinant of a matrix M„ of poly(n) 
size whose each entry is a bilinear function in v and x; cf. Example 3 in Section [10.21 Further- 
more, the specification of Mn can be computed in poly(n) time. Hence we can use det(M„) in 
place of Cn in the proof of Lemma llU.51 Then we can use SDIT in place of PIT for small degree 
circuits. The straight-line programs of the elements in S can be assumed to be weakly skew 
since the determinant has a weakly skew straight-line program |MP] . Q.E.D. 

The following is a full statement of Theorem 11.51 (1) and (2). 

Theorem 10.7 Let K he an algebraically closed field of characteristic zero. Let W = Wn be an 
explicit variety. Then: 

(a) An s.s.o.p. exists for K[Wn]. 

(h) The problem of constructing an s.s.o.p. for K\Wr^ belongs to P SPACE. This means an 
s.s.o.p. can be constructed in poly{n) work-space, given the circuit Cn specifying Wn as in 
Definition \10.Si 

(c) The problem belongs to C PL! assuming GRH. 

(d) The problem of constructing an h.s.o.p. for K\Wr,\ belongs to EXPSPACE. 

(e) It belongs to REXP^^ if the second fundamental theorem (SET) holds for K\Wn\ as in 
Definition \10.^ below. 

Analogous statement also holds for an explicit variety in a relaxed sense without any degree 
restriction. 

Definition 10.8 We say that SET (Second Fundamental Theorem) holds for an explicit family 
{Wn} of varieties if the ideal ofWn ^ -f^"* has a set Q of generators such that (1) each generator 
in Q has a straight-line program over Q and the coordinates zi,...,Zm of 

of 0{2Poly{n)^ 

bit-length, and (2) the specification of Q consisting of straight-line programs for its elements can 
be computed in time and poly{n) work-space. 

The size of the straight-line programs in (1) is clearly Vt[m). This size is exponential in n 
if m is exponential in n, as would be the case in the intended applications. Hence the poly(n) 
work-space restriction in (2) is an essentially optimal uniformity condition. For this reason, we 
say that {M^n} has explicit defining equations if SFT holds for {Wn}- 



63 



Proof of Theorem \10.1 



(a) Analogue of Theorem 12.11 also holds for strong black-box derandomization. Specifically, the 
hitting set B in Theorem 12.11 is also a hitting set against all non-zero polynomials that can be 
approximated infinitesimally closely by arithmetic circuits over K and r variables of size < s 
and degree < d; cf. Theorem 4.4 in |HSj . Hence the proof of (a) is very similar to that of 
Theorem 13.91 and 18.181 (c). Specifically, we now use the hitting set B given by Theorem 12.11 
(with appropriate parameters) in place of the hitting set T used in the proof of Theorem 110.41 

(a) . Since the new hitting set B cannot be computed efficiently, what we get now is an s.s.o.p. 
instead of an e.s.o.p. 

(b) and (c): The proof of (a) can now be constructivized very much as in the proof of Theo- 
rem 13.111 So we only give a sketch. 

Let Fn{v, x) and C„, be as in Definition 110.21 Let x)c denote the degree c part (in v) of 
Fn{v,x). We obtain circuits Cn,cS for Fn{v,x)cS from Cn using interpolation as in the proof of 
Lemma llO.51 (c). Given any potential hitting set T, let S and /if,^c(-2)'s be defined as in eq. (I46p . 
Let 

A = i;~\S) = {i/r\h4z)) = Fn{v,b), I 6 G r,0 < c < deg(F„)}. 

It follows from the proof of Theorem 110.41 (a) that the set S is an s.s.o.p. iff every fj{v) 
vanishes on the zero set Z{A) of A. This is so iff every element oi B = {Fn{v, b')c \ b' G [d+ 1]"}, 
d = deg{Fn), vanishes on Z{A). This last test can be done in poly(n) work space using the 
PSPACE-algorithm for Hilbert's Nullstellansatz |Kol IKol] . Assuming GRH, it can be done by 
a n2-algorithm using Theorem 12. 141 Now just guess T and carry out this test. By the proof of 
(a), a correct guess exists. 

(d) The proof is similar to that Proposition 18.21 

(e) The proof is similar to that of Proposition 13.11 using the assumed SFT in place of Theo- 
rem EH Q.E.D. 

This proof also yields: 

Theorem 10.9 The problem of strong black-box derandomization of PIT belongs to PSPACE 
unconditionally and to S3 assuming GRH. 



10.4 Equivalence 

The following is a precise statement of Theorem 11.61 

Theorem 10.10 (Equivalence) Let K be an algebraically closed field of characteristic zero. 

(a) The strong black box derandomization of symbolic determinant identity testing (SDIT) over 
K is equivalent to strict derandomization of Noether's normalization lemma for A[det,m]. 

(b) The strong black-box derandomization for SPIT is equivalent to strict derandomization of 
Noether's normalization lemma for A[perm,m,m] (cf. Example 3 in Section \l0.2.1\) . 

(c) The strong black-box derandomization of general PIT over K is equivalent to strict deran- 
domization of Noether's normalization lemma for the orbit closure of the P-complete function 
II(Y) defined in Section 6 of \MS1^ (and denoted as A[H{Y)] there). 
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(d) The similarly defined strong black-box derandomization of PIT for depth three circuits over 
K and n variables with degree < d and top fan-in < k is equivalent to strict derandomization 
of Noether's Normalization Lemma for the k-th secant variety X{d,k,n) of the Chow variety 
defined below. 

Let be the space of degree d homogeneous forms in n variables, and P{S^) the associated 
projective space. The variety X{d,k,n) C here is, by definition |Llj . the projective 

closure of the set of polynomials that can be expressed as sum of k terms, each term a product 
of d linear forms. 

Proof: We shall only prove (a), the proof of (b), (c), and (d) being similar. 

Strong black-box derandomization of SDIT implies strict derandomization of Noether's nor- 
malization lemma for A[det,m] by Theorem 110.41 (b). 

Conversely, suppose Noether's normalization lemma for A[det,m] C P{X) has strict de- 
randomization. (Here X, as in Example 3 in Section 110.2. H is the vector space over K of 
homogeneous polynomials of degree m in the entries of an m x m variable matrix Y.) This 
means the homogeneous coordinate ring i2[det,m] of A[det,m] has a strict e.s.o.p. S. Let 
/[det,m] be the ideal of A[det,m] so that i?[det,m] = K[X]/I[det,m]. 

Let Zq's, where a ranges over the monomials in the entries of Y of degree m, be the coordi- 
nates of X. Thus each homogeneous form h{Y) of degree m can be written as Za{h)a{Y), 
where Za{h) G K denote the coordinates of h G X. If h{Y) is non-zero, we also think of it as a 
point in P{X). 

Since 5 is strict (cf . Definition 110. 3p , each element of S is of the form 



for some m x m matrix b G of poly(n) bit-length, and the specification of S specifies each 
such b. Let B = {b \ Sb € S}. Its bit-size is clearly poly(?i). It is poly(n)-time computable since 
S is. 

So we only have to prove that B is a, hitting set against symbolic determinants of size m. 

Since S is an e.s.o.p., i?[det,m] is integral over the subring generated by S. Hence each Zc^ 
satisfies a monic polynomial equation of the form: 



where each aj is a non-constant homogeneous polynomial in the elements of S. It follows that 
every element in S cannot vanish at any given (nonzero) h = h{Y) G A[det,m]. Otherwise, 
every Zq. would vanish at h, and hence, h would be identically zero. 

Now suppose a nonzero polynomial h = h{Y) of degree m can be approximated infinitesimally 
closely by expressions of the form det (!"'), where y' is an m x m matrix whose each entry is a 
homogeneous linear form in the entries of Y with coefficients in K. By the definition of A[det, m] 
(cf. Example 3 in Section Il0.2.ip . it follows that h{Y) considered as a point in P{X) lies in 
A[det,m]. Since h{Y) is not identically zero, it follows from the above argument that some 
Sh £ S does not vanish on h; i.e., h{b) ^ 0. This means B is a hitting set against every nonzero 




a 



z\ -I- ak~\z\ ^ H h ao = 0, mod /[det, m] 
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polynomial h{Y) that can be approximately infinitesimally closely by symbolic determinants of 
size m. In other words, SDIT has strong black-box derandomization. Q.E.D. 

11 An approach to derandomization 

Theorem IIU.IUI suggests a natural approach to black-box derandomization of PIT via deran- 
domization of Noether's Normalization Lemma for the associated variety. In this section we 
formulate a few conjectures which may be helpful in this context. 

Let W C be an explicit variety as in Definition 110.21 Theorem 110.41 leads to: 

Conjecture 11.1 Noether's normalization lemma for the coordinate ring K[W] of any explicit 
variety W can be derandomized in a strict form (cf. Definition \10.3\) . 

When W = A[det, m], this implies black-box derandomization of SDIT (Theorem 1 1 . 1 p . For 
black-box derandomization of PIT (without degree restrictions), let W be the variety associated 
with the P-complete function H{X) in jMSlj instead of A[det,m]. 

11.1 An explicit Grobner basis 

We now formulate the notion of an explicit Grobner basis which may be a helpful for proving 
this conjecture. 

Let y = (yi, . . . ,ym) be any explicit homogeneous system of coordinates for iT™, possibly 
different from z = {zi,...,Zm) above. By explicit, we mean that each ip*{yj) has a short 
specification as a straight-line program (as in Definition 110.31 (a)) that can be computed in 
poly(n) time. Let I{W) C K[y] be the ideal of W. Let -< be the reverse lexicographic term 
order on the monomials in y/s, with yi < y2 ~< ■ ■ ■ ~< y-m- Let I^{W) be the monomial 
ideal generated by the initial monomials (with respect to -<) of the polynomials in I(W). The 
monomials not in I^{W) are called standard. Let GB{W) C I{W) be a Grobner basis |Stm2j 
of I(W) with respect to the term order This means the initial monomials of the polynomials 
in GB{W) generate mW). 

Definition 11.2 We say that the Grobner basis GB{W) is explicit if (1) its specification con- 
sisting of a straight-line program for every g{yi, . . . ,ym) S GB{W) can be computed in poly{n) 
work space, (2) every standard monomial involves only poly{n) yj's, (3) given a monomial 
in yj's in a sparse representation (only nonzero exponents are specified in binary), whether /x 
is standard or not can be decided in time that is polynomial in n and the total bit-length of the 
sparse representation of fj,, and (4) for any d, whether there exists a standard monomial of degree 
d can be decided in poly{{d) ,n) time, where {d) denotes the bit-length of d. 

By (2) and (3), the Hilbert function h\Y{d), the number of standard monomials of degree d, 
is a ^P- function, and by (4), the problem of deciding if it is non-zero belongs to P. The size of 
the straight-line programs in (1) is Q,{m). This is exponential in n if m is exponential in n, as 
would be the case in the intended applications. Hence the poly(n) work-space requirement in (1) 
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is essentially optimal. In contrast, the work-space requirement of the Grobner basis algorithms 
|MR2] for general varieties is exponential in n. 

An explicit Grobner basis exists for (the ideal of) the Grassmanian by the straightening 
algorithm in Chapter 3 of |Stm2] ). for Schubert varieties by the straightening algorithm in 
|ReSj ■ for G/P and their Schubert subvarieties by the straightening algorithm in |GLj . and 
for the explicit toric variety associated with the Birkoff polytope by Theorem 6.1 in |DS] and 
Theorem 3.1 in |Stml] . 

Conjecture 11.3 Let W be any explicit variety in Theorem \ 10.1 (A Then I{W) has an explicit 
Grobner basis. 

A plausible approach to black-box derandomization of SDIT is to (1) first construct an 
explicit Grobner basis for the ideal /[det,m] of A[det,m], and (2) use it derandomize Noether's 
Normalization Lemma for A[det,m] (in a strict form), the story for general PIT and PIT for 
depth three circuits being similar. The step (2) depends on (1), and nothing can be said about 
it at present since the step (1) itself seems extremely difficult. Hence it would be interesting 
to carry out these steps as far as possible for easier explicit varieties first, such as V/G in 
Theorem 11.11 explicit toric varieties, explicit quiver varieties, explicit curves, surfaces, and so 
on. 

11.2 Derandomization of the Grassmanian 

With this in mind, we now carry out these steps for one of the simplest explicit varieties-the 
Grassmanian. The results in this section can also be extended to other determinantal varieties 
such as G/P and their Schubert sub- varieties. But we focus on the Grassmanian since this 
illustrates all the basic ideas. 

Let G^ be the Grassmanian of A;-dimensional subspaces in K" . Any A £ can be identified 
with a kxn matrix over K whose rows correspond to k basis vectors in A. This matrix depends 
on the basis, but this choice does not matter in what follows. Hence, by abuse of notation, 
we denote any such matrix by A again. For any 1 < ii < i2 <■■■< ik ^ n, let A[ii, . . . ,ik] 
denote the k x k minor of A involving the columns ii, . . . ,ii.. Then A[ii, . . . , i^] are the Pliicker 
coordinates of A |Fu] . Consider the Pliicker map : K^^ — )• K*, i = (^), given by 

V^: (...,^[ii,...,ifc],...) Ci^*. (47) 

It is well-known (cf. pages 227 and 384 in [FuJ) that the image of this map is closed, and 
can be identified with the projectivization of this closed image Im(?/;). Let z = (. . . , . . .) 

denote the Pliicker coordinates of K*, so that if)* {zi^^,,,,ij,){A) = . . . ,ifc]. Let / denote the 
ideal of = Im('i/;). Then K[z\/I can be identified with the homogeneous coordinate ring 
A'[G^] of the Grassmanian. It is easy to see that the Grassmanian is an explicit variety (cf. 
Definition 110. 2p with the defining Pliicker map (|47p . 

Given any A E G^ and any n — A;-dimensional subspace L in K^^ fix any {n — k) x n- matrix 
representing L (which we denote by L again), and let [A; L] denote the nxn matrix whose first 
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k rows are formed by A and the last {n — k) rows are formed by L. By Laplace expansion, 

det[^;L]= ^ A[ii, . . . ,ik]L[ii, . . . 

[il,-,ik] 

where L[ii, . . . , i^] denotes the cofactor of A[ii, . . . , i^]. This is formed by taking the {n — k) x 
(n — k) minor of L formed by the columns with indices different than ii, . . . , ifc and multiplying 
it by an appropriate sign, which we denote by sgn[ii, . . . , i^]. 

Define 

ZL= L[h,---M^h,...,,^K[Gll (48) 

so that for any A G G^, 

zi^{A) = det[^;L]. 

Ignoring a constant factor, zl depends only on L and not on the choice of the matrix representing 
L. 

We say that Noether's normalization lemma for Er[G^] is derandomized in a strict form if 
there exists a poly(n)-time computable set S = {Li, . . . ,L;} of (n — A;)-dimensional subspaces 
in K"^ such that /^[G^] is integral over the subring generated by S' = {^l^, . . . , zl^}. 

Proposition 11.4 Noether's Normalization Lemma for KlC^^] can be derandomized in a strict 
form. 

We also note down a consequence of this result in the context of black-box derandomization 
of the symbolic determinant identity testing (SDIT) of the Grassmanian type. 

By a Grassmanian determinantal projection of degree n over K we mean the determinant of 
an n X 71 matrix A such that (1) for some 1 < k < n, the (i, j)-th entry of A, for i < k,l < j < n, 
is a variable Xij, and (2) every other entry of A is an arbitrary nonuniform constant from the 
base field K. The Grassmanian SDIT is the problem of deciding if det(^) is identically zero. 

The non-black-box version of this problem is easy: det(^) ^ iff the rank of the submatrix 
of A consisting of rows higher than k is n — k. This can be checked fast in parallel over any 
field |Mul| . We call the SDIT above of the Grassmanian type, because its black-box version is 
equivalent to (strict) derandomization of Noether's normalization lemma for the Grassmanian 
Gl (cf. Lemma ELU). 

The black-box derandomization problem for SDIT of the Grassmanian type is solved opti- 
mally in the following result. As was pointed out to us after this article was written, this has also 
been proved independently in Forbes and Shpilka |FSj with a different proof technique based on 
Gabizon and Raz |GR] but essentially the same hitting set. 

Proposition 11.5 Let K he any field that contains at least distinct elements; e.g. Q, C, or 
a finite field Fp of cardinality at least . Then there exists an explicit poly(n)-time computable 
hitting set against the Grassmanian determinantal projections of degree n over K . 

Equivalently, there exists an explicit poly{n)-time computable set S = {Li, . . . ,Li} , I = 
poly{n), of {n — h)- dimensional subspaces in such that for any k-dimensional subspace A C 
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K^, there exists an Lt £ S such that LfCiA is the origin. AH subspaces are assumed to contain 
the origin. 

The cardinality of S here is optimal (equal to dim{K[G'^])). 

More strongly, S can be computed by a uniform AC^ circuit of poly{n) bit-size. 

We call the set S in this theorem an explicit hitting set against all A;-dimensional subspaces of 
K^. The optimality is possible here because i^[G^] is much simpler than /ir[y]'^ in Theorem ll.il 
orfOl 

We shall show that the following explicit set S" is a hitting set. Let / = k{n — k). Let \j G 
1 < J < J^, be distinct elements of poly(n) bit-lengths, and aj G -fC, 1 < t < I, distinct nonzero 
elements with poly(n) bit-lengths. Let Lj, 1 < t < /, be the (n — A;)-subspace of spanned by 
the rows of the {n — k) x n matrix whose (r, s)-th entry is af A^. Let 

S = {Li,...,Li}. (49) 

Clearly S can be computed in poly(n) time, and also by a uniform AC^ circuit of poly(n) bit-size. 
Hence it is explicit. 

If K is not algebraically closed, replace it by its algebraic closure. Certainly S is a hitting 
set over K if it is a hitting set over its algebraic closure. Hence, without loss of generality, we 
can assume that K is algebraically closed. 

Proposition 111.51 follows from Proposition II 1.4l in view of the following equivalence. 

Lemma 11.6 (Equivalence) Black-box derandomization of the Grassmanian SDIT is equiva- 
lent to derandomization of Noether's normalization lemma for K[G^] in a strict form. Specif- 
ically, a set S = {Li, . . . ,Li} of (n — k)- dimensional subspaces in K"^ is a hitting set against 
all k-dimensional subspaces of iff K[G'^] is integral over the subring generated by S' = 
{ZL^,. . . ,z^}. 

Proof: Suppose S = {Li, . . . ,L/} is a hitting set against all /c-dimensional subspaces of K^. 
This means for every fc-dimensional space A £ G^ there exists Lj, j < I, such that zl.{A) = 
det[A; Lj] / 0. Since the image of the Pliicker map (cf. (jl7|l ) is closed and coincides with G^, 
it follows that G^ n Z{S') = {0}, where G^ is embedded in as in (gT]), Z{S') C denotes 
the zero set of «S", and denotes the origin in K^. Now it follows from Hilbert's Nullstellansatz 
and the graded Noether's Normalization Lemma (cf. Lemma l2.10p that i^[G'5!] is integral over 
the subring generated by S' . 

Conversely, suppose i^iC^] is integral over the subring generated by S' = {zi^, . . . , z^}. 
Then we claim that S is a hitting set against all /c-dimensional subspaces A G G^. Suppose 
to the contrary that there exists an A G such that det[74;Lf] = for sll 1 < I < t. Then 
ZLt{A) = det[A; Lt] = for all t. Since each ^^ii,...,^^. is integral over the subring generated by S", 
it satisfies a monic equation of the form 

+ ar-i<i"^.,i, + • • • + + ao = 0, 

where each Oj is a non-constant homogeneous polynomial in the ring generated by S' . Hence, 
each Oj vanishes at A. It follows that each vanishes at A. That is, every Pliicker 
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coordinate A[ii, . . . of A is zero, and hence the rank of A is less than k; a contradiction. 
Q.E.D. 

Let S = {Li, . . . ,Li},l = k{n — k), be as in (j39|) and let S' = {zl^ , . . . ,z^}. Proposition [TT31 
follows from the following result. 

Lemma 11.7 The coordinate ring KlG^] of the Grassmanian is integral over the subring 
generated by S' . 

Since / = dim(K[G^]), this means S' is also an h.s.o.p. of A'[G'^]. 

The proof of this result is based on the existence of an explicit Grobner basis for the ideal 
of the Grassmanian (cf . Chapter 3 in |Stm2j ) , which implies that its coordinate ring is a Hodge 
algebra |DEP2j . As such it illustrates an approach to derandomization in Section 111.11 based on 
explicit Grobner basis in the simplest setting. 

Let P denote the set of Pliicker coordinates of G^. Make it a poset by setting Zi^,...,^^. < 
Zj-i^^...jf, iff iq < jq for all q. The height of P, i.e., the length of a longest chain in P, coincides 
with / = dim.{K[Gl]) = k{n - k). It is known |DEP2j that K\Gl] is a Hodge algebra over P 
(this is a consequence of an explicit Grobner basis for the ideal of |Stm2| ). This means: 

(1) The standard monomials in the Pliicker coordinates form a basis of Here we call 
a monomial in the Pliicker coordinates standard if it is of the form Z;i ,i za a ■ ■ ■ , where 

1 ' * * *' fe 1 ' ■ ' ■ ' fc 

[i},...,4]<[if,...,i2]<.... 

(2) If ^ is a nonstandard monomial in the Pliicker coordinates and 

3 

is its unique expression as a linear combination of distinct standard monomials ^j's, then for 
each Pliicker coordinate Zi-^^,,,^i^ that divides ^ and for each ^Uj, there exists a Pliicker coordinate 
Zj/ , that divides and satisfies < -Zn,...,^^- 

We will deduce Lemma 111.71 from the following result based on the Hodge algebra structure 
ofK[Gl]. 

Theorem 11.8 (DeConcini, Eisenbud, Procesi) (cf. Theorem 6.3. in ]DEP^ ) For any c, 

1 < c <l = k{n — k), let 

Zc= ^ a{a)zi^,,„,i^, 

where [ii, . . . , ik] ranges over all Pliicker indices of height c in the poset P, and each a{a) is any 
nonzero element of K . Then {zc} forms an h.s.o.p. of KlG"^]. 

Here the height of a Pliicker index [ii, . . . is X]j=i(b' ~ •?)■ jDEP2j . this result is stated 
with each a{a) = 1. The proof for general nonzero a(a)'s is very similar and hence is omitted. 
When each a(a) = 1 as in |DEP2j . z^s are P*^-computable. In contrast, the elements of S' in 
Lemma lll.7l are P-computable and even A^C-computable. 

Proof of Lemma lll.7t 
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Let X G be a generic fc-dimensional subspace. For any Lj, t <l, let Yt = [X; Lt\. Identify 
the Pliicker coordinate Zi^^,,,^i^ with X[ii, . . . ,ik\. Thus 

ZL,=dei{Yt)= J2 L[h,...,ik]X[h,...,ik](^K[G',:], (50) 

[ii,--;ik] 

and 

S' = {det{Yt)}. (51) 

Let A(yi, . . . , y,.) denote the determinant of the r x r Van-der-Monde matrix whose (i, j)-th 
entry is y*-. For our specific choice of Lt, 

Lt[ii, ...,ik]= aj^'^'sgnih, . . . , ifc]A(Aji, . . . , Aj^_J = af'=^'~^^=^'''sgn[ii, . . . , ifc]A(Aji, . . . , Xj„_J, 

(52) 

where 1 < ji < • • • < Jn-fc ^ n are the indices that form the complement of ii < • • • < ifc. 
For 1 < c < / = A:(n - k), let 

Zc = ^ sgn[ii,...,ifc]A(Aj,,...,Aj,^_jX[ii,...,ifc], 

[«lv,jfc] 

where ranges over the Pliicker indices of height c, and [ji, ■ ■ ■ , jn-k] denotes the 

complement of [h, . . . , ik] in [1, . . . , n]. 

By (f52|) . for any Pliicker index [ii, . . . , ik] of height c, 

. . . , ifc] = aF"=^ '^-^^'=^ '^"'sgn[ii, . . . , 4] A(A,-„ . . . , A,„_ J 

n(n+l)/2-fc(fc+l)/2 _c r- • ia^x \ \ 

= a* ^ a* sgn[^l,...,^fc]A(AJl,...,AJ„_J. 

Hence, by eq. ([^n|) . for any I < t < I = k{n — k), 

z,, = det(y,) = ^^^c^+D/^-^e^+D/^ ^ 

c=l 

Let Y be the vector whose t-th entry is det{Yt), and z a vector whose c-th entry is Zc- Then 

Y = af''^^^^'^~''^^^'^^^'^Az, 

where A is the Van-der-Monde matrix whose (t, c)-th entry is 1/af . 
It follows that 

_^^kik+l)/2-n(n+l)/2^.^y^ (53) 

Thus ZcS are linear combinations of det(Yt)'s. Since the set {zc} is an h.s.o.p by Theo- 
rem [TlHI it follows that S' = {det(yt)} is also an h.s.o.p. This proves Lemma 1 11. 71 Q.E.D. 
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